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CCHSTRUCTIOH OF SYNTHETIC DMA AND ITS 
USZ IS LARGE POL YPSPTIDS SYNTHESIS 

The scvernmen- n a3 certain rigiica in "."".is 
inven'ion as a result supper:: prcvide:! rne 
De par erne nc or cha Navy for tne work leading ~o tne 
preaenc in'/entioa. 

CH:sS'^E:r;HENCH: to r£lath:j applicatic^^s 

Tni3 application i^ a co n t i nua t i on- i n- par - 
Applica-ion Serial No, 927,253, riled Ncve-:ber ^, : 936 . 

inthgd[jCt:cn 

Technical Field 

The field i3 related to the production of 
hi s-h-^olecular-weiaht DOlymerS; eicher nucleic acida or 
peptides that are the expression products of the 
nucleic acids, and Is particularly related to the pro- 
duction of high- molecular- weight peptides containing 
repeating sequences by tiiocheniicaX pro'cesses, the pep- 
tide finding use as structured materials . 

Background 

Recombinant DMA technology has been applied in 
the isolation of natural genes and the e:cpre33ion of 
■these genes in a variety of host cells. Typically, 
this technology has had utility in producing biologic- 
ally aczire polypeptides, such: as interferons or pep- 
tide hormones, which were impractical to produce in 
useful amounts by other means. It was also possible to 
produce modified proteins by isolating natural genes 
and utilizing the techniques of site specific, in vitro 
mutagenesis to alter these genes and thereby change the 
polypeptides produced. Other polypeptides have been 
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to prcduce new zzlyzezz ites tnat are ohi-eric :nc:ecule3 
of z'rie several na:urally occurring :nc:l ecul as . 

WLzr. Che acveac of efi'icianc ana auco::iac3c 
5 :ne:hocs fsr one cnerr-ical syncnesis oi' DMA, :c has 
becc:ce posslcle zc synchssize encira genes ana zz 
:aodi;y such syn:hecic genes ac will d-iring the course 
of synthesis. However, these various ceohnolcgies have 
bean applied to the proauotion of natural or .uodifled 
-0 versions" of natural polypeptides. There have been very 
few aotea:p03 to use ohese teohnolog'i es to creace sub- 
stantially new polypeptides. In nacure , polypecTices 
have a wide range of chernical, physical and physi:- 
logical charac-eriscics . Nevertheless there are co~- 
15 :nercial applications for which known, naturally 
occurring polypepoidss are no- appropriaoe. 

■vhile biotechnology is versaoile, usually it 
has been limited in ios applications to naturally 
occurring products or [codifications of naturally occur- 
20 ring molecules. One great strength of organic chenical 
synches is, by contrast, has been the abilicy co trans- 
form: inexpensive carbon materials to a wide variecy of 
pGly:iieric molecules, including nacurally occurring 
molecules,' but most imporoancly encirely new chemical 
structures, such as polypropylene and polyacrylat es , 
which have <:LeTi:iet and predicted chemical properties 
not .associated vich naturally occurring molecules. 

Such materials, particularly high-molecular- 
weight polymers concaining repeating sequences of amino 
acids, have proven difficult to produce by biochemical 
means. The genes necessary for producing large pep- 
tides concaining repeacing units of amino acids were 
unstable and often underwent intermolecular recombi- 
nation causing deletions of repeating units in che 
gene. The development of a biotechnology which woulc 
produce polymeric molecules by biological processes 
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similar zz z'r.cse a'/ail^ble organic s/n:rie2i5 wculz 
3 : gn 1 f i 2an- ly ircacien "ne range of ap p 1 : ca t i. c n^ of 

c i c c e cnnc 1 cgy . 



5 5r:ef I^eacrictl^n o: "^e Rela^/an^ li'era'ure 

Tne olaning oC ^ul^iple lactase opera" crs up 
ZD four in cande.n is disc lose d by Sadler e t al . , Gene , 
i]9cO) 3_:2?9-30C. Hybrid bacterial plas^nida containing 
highly repeaced sacellice DMA is disclosed by Bruclag 
^ ' Celi , (■??';') ^:5C9-5'9. The syn-hesis of a 
po ly V as pa r t y 1 -p heny lalani ne ) in baccerla is oisolosad 
by Doel e " a 1 , , oleic Acids ^esearcn , ^'.9cC) S_:'^z^5~ 
^592. A ciechod for enricning for proiine concenc by 
cicnlng a plasmld wnicn codes for che procuo-ion of a 

'5 proline pclyrner was disclosed by Kangas e " al . , A p c 1 i e d 
and Znvi ronniantal Microbiology , (i932 ) Jil:529 -635. The 
biological li:nioacions on che length of highly repeti- 
ti/e D-fA sequences "hat :nay be stably [naintained within 
plasmid rsplicons is discussed by Gupta e " al . in 

2G 5 Ic/Technology , p. 602-509, September 1 93 3- 



SJMMAHY 0? THE INVIMTIQM 
Methods and compos ioions are provided for the 
production of polypeptides having repetitive oligoneric 
2 5 units by expression of a synthetic structural gene. 

The inaividual units coding for an oligomerio peptide 
sequence are varied as to nucleotide sequence utilizing 
amino acid co<^on recundancy. long nucleic acid 
sequences are built up by syncheslzing nucleic acid 
oligofners which express a plurality of individual repe- 
titive peptide units, and one oligomers are joined to 
provide a polynucleotide of the desired length. ex- 
pression syscems are usee which provide for the growch 
of the subject host to high density prior to signifi- 
cane expression of the polypeptide product, followed by 
Inouction of expressicn to pro ride high yields of the 
poiypeptide product, which can be isolated from the 
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'r.03Z cells. In cne e:iiocdi:aenc , a c/5~a:n Is eaiployec 
wnsra tne r ranscri p t ion inl-ia-icn 3/5te:n tna s/n- 
-ne~l2 ger.e is nc" recogniiiec. by the host RMA pcl/nier- 
ase anc a sene expressing a fanc"-cnal !^^^A pclynerase 
-nler inc^cible regula-lon is induces in -na hcs". 

2:il£r liSCRIPTIC^i Cr" THH! FlGuR^S 



Figure ' : Plasrnid pS-TC- s rue i: ure . 
?'igure 2: laimiino clo cs cf pcl/peptiice prccuc^s 
10 using an-i^ccy C3 ;A) Deta-laccaoias a cr to (5; gly- = la 
pep* ice . 

rigure 3: Construction flowchar" for plasuiit 
pG:C/31p:. 

figure 4: l:ii:nunG t loos of polypepoiae procuots 
;o (A) TTsp'O/iSlpI vioh anoi-Slp Ab . (5) T-^gpg/Slpl vioh 
anoi-31p Ab , or iZ) soaining with Cocmassie blue. 

figure 5: Ccnsoruoticn flowchart for plasruii 

pSXS56 . 

figure 5: Tioie course for accuoiulao ion of the 
20 ~ kanaayc in-rs3 is tance gene pro_ciuat with the T7 system. 

Figure 7: Construction flowcharo for plasnia 

pSr35T. 

Figure 3: Construction flowcharo for plassil 

p3Y980 , 

Figure 9: (A) A^iiao black stain of gel 
containing the procuct of be-a-galactosidase/SlpIlI 
gene fusion; (5) inmunoblct of sauie product wioh anti- 
Sip antibody. 

Figure 13: Ccnsoruoticn flowchart for pla3:nid 

3^ P3Y1230. 

jF5CRI?T1CM CF ??S:ZR?.^j FMSCDIMENTS 



^fcvel polypepoides are provided which are 
pclyoli gomers of repeating, relatively short, aaino 
acid sequence unios. The oligomers say be linked by 
spacers of different aniino acid sequence. The novel 
polypeptides therefore contain repetitive anino acid 
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secLience^ ar,^ are par":^ularl/ useful as fiircus pro- 
teins, inducing e las t omer i c . Tne gene encoaing cne 
repeacing-unil-ccn-aining pep-ines is pnccucec :c par- 
ticularly avciC prcolerns pre '/Icusl/ asscc:a:ec vvi-n 
_ genes ccn^aining ;T:ul-iple repealing units. 

Tne genes cf ::he sutjec:: Lc/ennicn ccfcprise 
TiultiTiers ciT DNA sequences encoding the sanie aniinc acic 
sequence uci", wrere two cr T:cre cii^ferenc multicers 
encoding differen- a:::ino acid units x.ay de Icined 
, "zcgez'nec to fccai a cIock ccpclv:::er. Tne incivicual 
units will have from 4 to 30 a^i 1 no acids (-2 tc '.20 
nt), mere usually - " c 25 amino acids (;2 zo 75 nc), 
particularly 4 tc 3 amino acids, usually naving ~ne 
same amine a s i. d appear ac lease twice in tne same un i : , 
. - g e n e r a 1 1 '/ s a o a r a c e d b v at least one a m. i n o acid. Tne 
uni's of zhe multimer coding for the same amimc acid 
sequence may in'/clve twc cr more nucleotide sequences, 
relying on the codec redundancy tc achieve the same 
amino acid sequence. 
2Q "or the most part the DNA compositions of this 

invention may be depicted by the following formula: 

where i n : 

K is a DNA sequence encodLrLg an a m. i n c acid 
sequence of from about 1 to lOO amino acids, usually 1 
to 50 amino acids, which may be any sequence, generally 
being fewer than about 20^ of the tctal number of amine 
acids, more generally being fewer than aboue ^0% of ehe 
toeal number of amino acids, which may be any sequence, 
par':iculary a naeurally occurring sequence where the 
multimer seructural gene has been fused to another DNA 
sequence in reading frame. '< will have the initiaeicn 
metnionine codon. 

< is 0 or i ; 



A 1 5 a 3:^"^ cecuence ^cdiriz eacr^ ~i~e l3 
appears 5cr -rie sane a:::ir:o aci- sequence unic ncr^ally 
having a~ leasz cne ac:ino acid appear a~ leasp "vice in 
the sequence, vhere A ivlll generr-ally be frcm acou" '2 
tc 9C nucieocices (nt), rncre usually fcr accu- '2 co "^5 
nucleo c i ces ; 

where Phere vill" usually 3e au leas" zwc ci:- 
ferenp A's, usually net mere than Pen diiferent A's, 
:acre usually nc: rcre than six cifferent A's, wni^n 
coce fcr the sanie a:ninc add sequence tut differ :rc~ 
each ocher-cy ai least cne nucleccide and :cay clfier ty 
a3 :Tiany as ten nucleotides, usually not diddering by 
acre than abcut fire nucleccides frcni another- A 
sequence, eacn cf the different A's usually being 
repeated at lease tvice; at lease two didferene ccczr.s 
are erep 1 c ye d for the s am e amine acid, e.g., GGC an d GG A 
dor glycine, in didferent A's ceding for the same amine 
acid sequence unit; 

n will be an Integer cf at least 2, usually ae 
least about 3, and not more than about 250, usually net 
more than adcue 200, frequently not mere than abcut 
125, and in scrae inseances may not exceed about 50; 

5 is a 0^^A sequence different from A cccLr.g' 
for an amino acid sequence oeher than the amine acid 
sequence unit coded by the A unit and serves as a 
linking unit between oligomers of A units. 3 will 
generally have frem about 3 te ^5 nt, (: to 15 amino 
acids) more usually from abcut 3 te 30 nt C: te IC 
ami no ac i ds } ; 

vhers the 3 units appearing in the gene may be 
the same or different, there usually not being mere 
than about -0 different 3 units, more usually not mere 
than about 5 different 3 units, where the 3 units mav 



30 



NSCCCiO: -WO =>'^CPA3?a^ 



wo S8/03533 PCT/US87/02S22 



25 



differ in f r c m i t 3 - 5 n t , c r 8 u 3 ua 11 / free: a c c u t 
^5 nt, ^here the cirfarent 3'3 rnay coca for the same cr 
different an i nc a c i i s a c a e :: c e ; 

0 Ls 0 or ' ana rnay differ each time there is 
5 a suGcesoiye A unit; 

q is an integer of at least 1 and will vary 
with the number of nucleotides in A and 3, as veil as 
the a 1 ue 3 of n an d p . The variable q will be selected 
so as to provide for at least 90 nucleotides for the 
:Z multimeric portion of the structural gene, Dre-eva'oly 
at least about 1 5 0 n t , more preferably at least 5 0 n t , 
and most preferably at least 900 nucleotides, and the 
number of nucleotides will usually not exceed about 
10,000, more usually not exceeding about 3,000, gener- 
is ally being In the range of about 900 to 6,000, more 
usually to about 5,000; and 

M is a DMA nucleotide sequence of 0 to l3 nt , 
which may encode any amino acid sequence, usually 
including amino acids of A and/or 3, generally limited 
20 to the amino acids of A and/or 3; 

X may be the same as or different from W, usu- 
ally different, and will have the formula 



where i n 



CA') ^ (3M t] ^ 
n ' P q 



A' , 3^ , M' , p' and q' are the same as or 
different from A, 3, n, p and q respectively, at least 
one being different, wherein the analogous symbols come 
within the same definition a3 their counterparts; 

X is 0 or 1 ; 

M is the same as or different from M and comes 
within the same definition as M; 

Y may be the same as or different from V, 
usually different, and will have the formula 
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, r,-, p- and z~ are cr:= sar.e as cr 



different frcn A, 5, n, p and c ras pec-iral/ , a" leas: 
c r.e b 5 i ng 1 i f f e r e n : , w ne rein - ne anal c gc u s 5 b c 1 5 c c :n e 
wicnin zhe same definitions as tneir counoer parts . 
7 is 0 or : ; 

1 is ( :c iQO, usually 1 zz 50, :nore usually '. 
to 30, par-icularlr 1, vrien x and y are 0; 

vnen cr y are 1, c, anc q- will ze a 
to-al of ao least 2, usually at least 5 and not rncre 
tnan adcu: 50, usually not ~cre tnan adcu: 30. 

Tne total number of nucleotides vlll be at 
leaso 90 nucleotides, usually at least about 1 5Cnt , 
preferably ao least abut 900nt and sar be 20knt (;<ilo- 
nucleotides), usually noi: mere tnan about 15nt, more 
usually not more tnan about lOknt. 

The polypeptide encoded by the above DMA 
sequence will hare the following formula: 

f ^ T t T f r ^ 1 

wherein: 

-t ^ will have the following foraula 



where 1 n : 

32 D is the a.-ninc acid sequence encoded for by A 

and therefore has the numerical limitations based on 3 
nucleotides defining a codon that codes for one aslno 
acid ; 

Z is the amino acid sequence enzcded for by 3, 
55 and: therefore has the numerical limita-ions based on 3 
nucleotides defining a codon, where each E may be the 
same cr different, depending upon the coding of 3; 
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and, wrierein, likewise K , W , M , X , , - 
and l' is ^ne aniina acid secuence encoded fcr c/ '< , >/ , 
M, X, M, ^ and L res p e o " i ve 1 y . Hcwever, in z'r.e case cf 
:< and L, aubsecuenc crocessing, sucn as prccease creac- 
5 -lenc, c/ancsen crc:::ice crea--enc , ecc . -ay resume in 
car-ial cr cornpleoe re:::cYal cf cne or C-cerminal 
non-multi-ieric chains. 

n, p, q, k, i. and 1 nave cne sane definitions 

as previcuol/ indicaced. 
1 Q. ParcicLilan polymeric compositions navmg 

receating jiulcioieric units having tne sa^ie compcsiticns 
(A) will have the following formula where x and / 
are 3 , 

•<:< CCD). (S).], 

where all of the sycidols have been defined 

previously; and 

cn.e DMA sequence will have the forciula 

where all of the sycibcls have been defined 
previously. 

Particular DMA sequences encoding copoly-ierio 
compositions having a repeating unit of two to three 
multimeric blocks will have the following formula: 

25 

(>^" M^^ -y^'" 

wherein: 

W" is a rnuloiT.er having the formula 

30 

[(A^) 3 (53) 3] 

n ^ 0 ^ q - 

where A^ is cf 4 to 3, usually ^ to 5 cocons, 
otherwise corning within the definition of A; 

n^ will be froci about 2 to i2, usua.^y ^ 
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-2 Cf rrc- 2 zz 5, .3u=ll/ - zz 5 ccccris; 



is 0 or : ; 

c- 15 cf frcr: acout 2 zo 25, usually 2 cc £0; 
X'^ and r'- ars cne sacis as cr ciifferenc frc- 
usually aifferenc, ccci-g wicnin cne 3a::ie 
d = fi-i:icns as a'^ 

M-^ and N*^ ccoie ;^icnin cne dafinicions cf 

and ; 

i is a- least 2, usually ac lease 5 and ncc 
:ncre than acout 75, usually not T.cre chan abouc 5C, 
ienerally net exceeding 50; 

vith tne other sy:aCol3 as defined pcevicusly. 
The co:i:pcsi cions of the invsncicn vill usually 
ha^e a Tioleoular weight of ac least anout ckOal, usu- 
''Z ally iCkDalf preferably 1 5k:;al and inar have "olecular 
weights as high cr higher as 40CkDal, usually rcc 
exceeding 30CkDal, more usually not exceeding acouc 
250kDal, 'he higher ranges generally being the ^ulciaer 
Gcdbinations , with the individual ^nulticier usually 
20 being less than about loOkDal, usually less than about 
100 kDal. 

The nucleotide sequences which are employed 
will be S7nthesi::ed, so that the repetitive units will 
have different codcns for the same amino acid as 
25 described above. 'Jsually, at least aboui: 25?, more 
usually at least about 40?, and generally at leas- 
about 50?, but not greater than about 95?, preferably 
not greater chan about 90? of the nucleotide sequences 
encoding the repetitive units will be the same. 
Greater diversity within those ranges will be employed 
where the initial constructs are experimentally shown 
to undergo spontaneous recombination events. 

Cf particular interest are polypepcides which 
have as a repeating unio SGAGAG (G - glvoine; A = 
alanine; 3 - serine). This repeaoing unit is Tzi.Z'i in 
a naturally occuring silk fibroin protein, which can be 
represented as GAGAG( S GAGAG) ;5SGAAGZ = tyrosine). In 
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the subject inventior^, t^^e repeating unit is designeci 
where the M-ter':ninu5 T.ay oe MGAGAG or any otner 
sequence cf general:/ a" least about 3 a-iino acics, 
usually at least acout 5 auilnc acibs. Ticre usually '2 
^ ^:T:inc acics anc not grea-er than 2C0, usually net 

greater than ^00 aaiino acihs, which may be different 
from the repetitive unit. Generally, a different M- 
terminus will be the result cf insertion of the gene 
into a vector in a manner tha^ results in expression of 

-0 a fusion protein. Any protein which does no c in-erfere 
with the desired properties of the product may provide 
the N-terminus. Particularly, endogenous hos- pro- 
teins, e.g. bac-erial proteins, may be employed, 
choice of protein may depend on the nature of 

15 transcriptional initiation region. Similarly, the ^ 

terminus may have an amino acid secuence different from 
the repeat sequence. Conveniently, there may be from 1 
to iCG, usually 1 to 25 amino acids, which may be the 
C-terminus. of a naturally occurring structural gene, 

20 which again typically results from the formation of a 
fusion product. 

A silk-like-orctain (Sip) gene may be produced 
by providing oligomers of from about 5 to 25 repeat 
units as described above, more u3ually-of about 10 to 

25 20 repeat units. 3y having different cohesive ends. 

the oligomers may be concatemeri zed to provide for one 
polymer having 2 or mors of the oligomeric units, 
usually not more than about 50 oligomeric units, more 
usually net more than about 30 oligomeric units, ana 

30 frequently not more than about 25 oligomeric units. 

The silk-like proteins may be varied by harving 
alternate multimers with the same or different handed- 
ness, ror example, in the formula, (3)p may provide an 
even or odd number of amino acids. In silk, the hydro- 

35 gens of the glycine may align on one side and "he 
methyls and hydroxyls of alanine and serine on the 
other. If (3)3 is even, there will be continuous 



wo 88/03-^^1 

' PCT/US8"/02S22 

1 2 

aiigniTienr , if odd, there will oe aloern=3lng ai:5n:::5no 
of (A)^, Thus, differeno proper- lea oar. da aohierec by 
ohanging ode ru:nber of arni-o aolds encoded dv CS;^. 

3f paroioular in^ereso are zclyzeuzites wniod 
5 roimio "de o or:p c 5 1 o i o n and pdysioal properdies of silk 
of BcT.tyx mo r 1 . 

Also of in- eras t are pol/pepoides wniod dare 
as a dase repeaoing unio GVGV? (G = glycine, 
V = raline, ? = proline), vniod :nay de found in naour- 

10 all/ occurring elasoin. In one subjeco invenoicn, one 
>r-derTiinus may be any convenient sequence anc , if 
desired, nay de in wnole or in par- renoved by a pro- 
oease. Usually ode N-oer"inal sequence vnion does noo 
dare tde subjeco zocif will be less odan adcuo :CC 

i5 aoiino acids, more usually less onan abouo 60 anino 
acids. 

Of paroicular Interest is a base sequence of 
about 6 to 10, preferably 8, units separated by a 
sequence of about c oo 20 aoiino acids, usually 3 to 15 

20 aniino acids, v.*nlGn aiay include an inoernal repeao 

differeno from the basic repeating unit of frcni ^ to 3 
amino acids. For example, tne second repeat sequence 
could be GAGAGS, repeated twice. Tne tcoal number of 
base repeating units will generally be In tne range of 

25 about 1 50 to 300, usually 175 to 250. Tne C-terminus 

nay terminate witn a repetitive unit or pore ion tnereof 
or a different 3eG.uen.ce of from 1 to ICQ, usually 1 to 
30 amino acids. Tde C-terminus is not critical to tne 
invention and will be selected primarily for ccnveni- 

30 ence. As with tne N-terminus , it :say be zeslgTLe'l for 
proteolytic cleavage. As in the case of tde silk pro- 
tein, tde subject elastin-like protein may be similarly 
engineered. 

Of particular interest are proteins wniod 

35 mimic tne properoies of elastin and provide for 
elastomeric properoies. 
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The copolymer in yo lying repe.a':ing units Ls a 
powerful Tiethcd for yarying proper-les, by appropriate 
onoice of one ciifferent: unios, zr.e number of unios in 
eaon niul^iTier, one spacing bee ween tr.e-Xy anb bne number 
5 of repea-3 of the rnuloijier co~binaoion assembly. Tnus, 
b y y a r y i n g t he numb e r an b a r r ange me n b of p r i m ar y mo no - 
mers, a yariety of differen*: physical and chemical 
p r 0 p e r b i e 3 ' G a n be a o h i e y e b . 

Exemplary of the use of the block copolymers 
10 are combinations of silk units and el as tin units to 
proyide products haying properties distinc-iye from- 
polymers only haying the same monomeric unio. 

To prepare o h e structural genes, y a r i o u s 
approaches can be employed. To prepare the oligomers, 
15 complemenoary strands of DNA may be synthesized, so 

that upon hybridization double-stranded DMA is obtained 
with bhe appropriate termini. If desired, each of the 
oligcmeric units may be the same, so thao in a single 
step, a concatemer aiay be obtained depending upon the 
20 conditions of the hybridisation. Normally, conyen- 
tional annealing and ligatlng conditions will be 
employed, such as are described in the examples that 
f oil ow . 

If desired, tvo different oligcmeric units may 
2 5 be prepared where the termini of the tvo units are com- 
p 1 em e n b a r y one with the other but the termini of the 
same unit are unable to bind together. In this way one 
ca.n build indiyidual oligcmeric units and then join 
them together to form the concatemer, where the inter- 
50 yening linking sequences are defined at least in part 
by the termini. Depending upon the construct, the 5' 
terminus may proyide for the initiation codon methio- 
nine, or the structural gene may be Joined to an adap- 
ter which may proyide for a unique sequence (optionally 
35 cleayable by a specific enzyme) at the 5' terminus or 
may be inserted into a portion of gene, usually endo- 
genous to the host, in coper reading frame so as to 
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prcvlce fcr a ^'usiar: product, 3/ prcrLding f^r apprc- 
prlaie ccaipienien~ar y "er^nlnl becvaen tne adap-ar cr 
-mncazac gene and ~de 5' end of tne subjec" structural 
gene, tne sequences can ce joined in proper reading 
5 frane 30 proriie for one desired prcoein. Advanoages 
onao cna/ be acnieved by e:nplC7-"5 adapoars cr fusion 
proteins include dating szeaiTLc sequences for special 
purposes, sued as linking, secretion, complex for:naoion 
vitd coder proteins, affinity purification, or tne 
■a like. 

Once one sorucoural gene das' been asse^nbled, 
io may be cloned; clones naving one desired gene, 
particularly as to sequence lengtn, n:ay be iscla-ed; 
and one gene jiay be rerooved and used oc join oo a 

15 sequence for expression. 

The expression construco will include orans- 
criptional and transl aoional initiation and terrninaoion 
regulatory regions, 5' and 3', respect i rely , of tne 
structural gene. As already indicated, these regions 

20 oiay be created by exploring a fusion prctain, vhere the 
subject structural gene is inserted into a different 
structural gene downstream froni its initiation ccdon 
and in reading frame with the initiation ccdon. Alter- 
natively, various transcriptional and translat ional 

25 intiation regions are available fro3 a wide variety of 
genes for use in expression hosts, so that these tran- 
scriptional and translational initiation regions may be 
joined to the subject structural gens to provide for 
transcription and translation initiation of the subject 

3^ structural genes. A wide variety of termination 

regions are available which may be from the same gene 
as the transcriptional initiation region or from a 
different gene. Mumerous constructs have been dis- 
closed in the literature, and the same procedures may 

35 39 applied with the subject gene as have been employed 
with other structural genes. 
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Of particular Interest: is the use of an m 
ducible -ranscrlption initiation region. In chis 
Tianner, the host strain ma/ be grown to hign oensit/ 
prior to signifioant expression of the cesire:! procuo-. 
Providing for incuoiole o rans o r ip t ion is par-ioularly 
useful where the peptide is retained in one cellular 
host rather than secreted by the host. 

A nu-der of inducidie transoiption initiation 
regions exist or can be employed in particular situa- 
tions. The inducible regions may be controlled by a 
particular chemical, such as iscpropyl t hio galao t o s i de 
(IPTG) for inducing the beta-galaotos idase gene. Ooner 
inducible regions include lamoda left and rignt pro- 
mc-ers; various amine acid p o 1 y c i s t r o ns , e.g., histi- 
dine and tryptophan; temperature sensitive promooers; 
and regulatory genes, e.g., cl'"^ 357. 

An alternative system which may be employed 
with advantage is use of a combination of transcription 
initiation regions. A first transcription inioiation 
region which regulates the expression of the desired 
gene but which is not functional in the expression hose 
by failing to be functional with the endogenous RMA 
polymerase is employed. A second transcription inioia- 
tion region, such as an inducible region, can then be 
employed to regulate the expression of an RNA poly- 
merase with which the first transcription initiation 
region is functional. In this manner expression only 
occurs upon activation of the regulatory region con- 
trolling the expression of the exogenous R-VA polymer- 
ase. In the subject application, this system is illus- 
trated with the T7 phage transcription initiation 
region, specifically the initiation regions of genes 9 
and 10 of XT phage. 

An alternative system relies on the use of 
mutants which undergo a developmental change based on a 
change in the environment, such as a lack of a nutri- 
ent, temperature, osmotic pressure, salinity, or the 
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i:<e. :i:u3tra:i'/5 of thi3 373::9:!i, strains of 5_' 

ca- be cctainec vhich are Incacaile of socru- 
la'icn buz vhicri can prccuce these ccrnponents which 
iniciate excresaicn oS prciucts invclvec wich spcrula- 
5 ticn. Th-refoce, by ^ change in rhe ccnai-icn zf the 
:neciun, a transcription initiation region asscciateh 
with spcrulaticn will be actiYateh, In this situation, 
the hc5t pro7icle3 the necessary inciucing agent or 
acciyaoor to Initiate expression, 
10 Various other techniques exist for proyicing 

for inducible regulaoion of i:;r ans cr i p o ion anc orans- 
lation of a gene in a particular host, 

?oc the -oat part, the host will be a uni- 
cellular organisni, eioher a prokaryooe or a eukaryooe, 
15 selecoed fro:n bacteria, algae, fungi, filanienoous 

fungi, etc. Illustratiye hosts include E^. _coli_, 5. 
subt ills , 3_. s tearothernzcohilus , S_. cerevisiae . and the 
like. 

The expression construct- for expression of the 

20 desired gene, by itself or in conjunction with any 

auxiliary genes involved wirh transcription, will ncr- 
oially be Joined to an appropriate vector for introduc- 
tion into the expression host. A large number of 
vectors are coGimerc i ally available with others being 

25 described in the literature. The vectors are nor.Tially 
characterized by having one or more unique restriction 
sites, a replication systea for ext rachromosomal uiain- 
tenance in the host, and one or .-nore niarkers which 
allow for selective pressure cn the host. The [barkers 

30 -lay provide ccmplemen-at ion , prototrophy to an auxo- 
trophic host, resistance to a biocide, e.g., an anti- 
biotic such as penicillin or kanaaiycin, or the like. 
In seme instances, rather than selective pressure, a 
:narker gene is employed whicn allows for detection of 

35 particular colonies containing the gene. This situa- 
tion is illustrated by the gene for be ta-galact os i dase , 
where a substrate is employed which provides for a 
colored product. 
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7>3 exprsssicn ccnscruc", Inducing any 
auxiliary genes, :nay be in-rohucen into !:he expression 
Yec-or in aooorhanoe with known ^eohni-ues, par*:iou- 
larly eTiploying rescriotion, insertion, ani ligation. 
^ 'he excression oons-ruc: nay "hen 3e useh for 

transf or:!:at ion of tne appropria-e hoa". Depending upon 
the hooo, either incact oella or protoplaet may be 
employed, where transformation or conjugation is em- 
ployed. Conveniently, oal c i urn- phcs phat e- p rec i p i tat ec 
13 DMA or non-ionic decersenzs may be employed to intro- 
duce the plasmid into the hos^. It should be appreoi- 
a-ed thac it is not necessary to employ vectors for 
hcs- -ransf ormation, since bare DMA can be introduced 
for integration into the genome. However, even where 
15 integration is desired, a much greater efficiency of 
integration is achieved employing the vectors, thus 
favorir.^ the employmeno of vectors. 

2e;ier\cLLng upon the nature of the vector, ohe 
expression construct may be maintained on an extra- 
20 chromosomal element or bsccme integrated into the host.- 
Where integration is desired, it will usually be 
desirable with prokaryotes and some eukaryotes to have 
a secyuence homologous to a sequence in the chromosome 
of the host. Usually the seauence will be at least 
25 abouo 200 bp and not more than about 5000 bp, usually 

not more than about 2000 bp. The choice of the homolo- 
gous sequence is somewhat arbitrary, but may be used 
for complementation, where the host is an auxotrophic 
mutant and the homology provides procotrophy. 
30 The t ransf ormants or inoegrants may then be 

grown in an appropriate nutrient medium to high den- 
sity, followed by induction of transcription in accor- 
dance with the nature of the transcriptional system of 
the expression construe:;. Where the desired protein is 
retained in the cytoplasm, these cells are harveaoed 
and lysed, and. depending upon the use of the protein, 
the orotein may be further purified in accordance with 
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ccnvs::ticnal -ec'r.r.Lz'ies, 3ucn as cnr c-a" cgrachy , 
30lve::--3clv5nz ex-rao-ic", arfinl"/ chrc-a- cgraph/ . . 
a-d trie like. 

repe-irive crctal^s can fina a yarie"/ cf 
5 U353. Tna 31c crccei"5 -a/ ca ^sec ia crca-aing ficera 
having unique prcpercies, as a subacicuce for silk, ana 
-rie like. Ccllagen prcteins can be procucec, wnere the 
ccllagerL is free of the t slopep cice or contains the 
celcpepcide, depending ^pcn its functicn. itelopep- 

^0 tideccllagen shCLild have liccle if any i:nnuncgeni c i c y , 
30 as to be a useful soructural ele:nent for a varieoy 
of prosthetic devices or for use as a collagen substi- 
tute in coder appli oacicns . Similarly, coder prooeins 
having repetitive seouences, such as keraoin, can also 

]5 be prepared in acooroancs vioh the subjeco invenoion. 

Other useful repeoitive prooeins can be prepared based 
on sequences of spider silks and other repeoitive 
anlnial fibers. Artificial peptides useful for i-muni- 
zaoion could also be prepared based on repeating 

20 sequences present in various surface antigens cf 

disease-causing ni crocrganisms , such as parasites, 
bacteria, and viruses. 

The following exaoiples are offered by of 
illustration and not with liniitation. 
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DMA Preparation Methods 

: . Preparation of olasmid D^^A from.!, coli : 

A. Sciall scale; Plasmid DMA was prepared 
from 1.5 nil cultures by either the boiling procedure or 
ohe alkaline lysis iTiethod (Maniaois, e^ . Molecular 
Cloning: A Laboraoory Manual. Cold Spring Harbor 
laboraoory, Cold Spring Harbor. (1982)). 
35 3. Large scale: A plas-ii d-carrying strain 

was grown overnight in ' liter of Luria broth with ohe 
appropriate anoibiooio. The cells were collecoad by 
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cer:::rirusa"ion ^C,GCOxg for 5 :nin and resuspencec i 
^0 rnl cf ice ccld TH: (TO-M Tria-HCl pH 3, 1 :nM ZDTA). 
The cells were c en r 1 f u g e c again, res'^ccencec in - -1 
or TiS (TH: and v/v sucrose) and hcfncgen i zed c/ 

vcr-exing. Tne sarnples were kepo on ice for zne fol- 
lowing steps. Lysos/rTie (i nl of 10 rng/-il ) was added to 
the cell suspension and incucaoad for 5 T.in before tne 
addition of 2 :nl of 0.5M ZDTA pH 3. After 10 :nin incu- 
dation, 50 ml of proteinase K (-0 mg/ml) were added 
followed 10 jiin later with ^5 :nl of lysing duffer (O.'J 
Triton X-'OO, 1 "M ElOTA, oOn^M tris-HCl pH 3). After 15- 
20 :T:in, the cell lysate was centrifu ged at 35,0 00 xg for 
90-120 min. The supernatant (19.3 -1; was transferred 
to a plastic tube with 20 g cf CsCl and ^00 ul of 
ethidiun: brooiide ('0 :ng/:Til). After dissolution, the 
.fixture was divided into two polyallomer ultracentri- 
fuge tubes, sealed with heat and centrifuged in a 
Seck-an Tl 65 rnctor at cG,000 rpm for 2^ nr. The lower 
plasmld DMA band was removed from the tube with a hypc- 
derrnic needle. The ethidium bromide was extracted 
three times with an equal volume of NaC l-s at ura t e a 
isopropanol. Two volumes of HpO were added to the DMA 
solution, and then the DMA was precipitated with 
et hand . 



2 . Preparation of doub 1 e- s t randed DMA : 

A culture cf JMiG3 was grown to an CD,5qq of 
about 0.2 and then divided into aliquots of 2 ml. Each 
aliquot was ia^ecze't with a fresh plaque of Ml 5 and 
30 incubated at 37^G for about 6 hr with vigorous shaking. 
Then the cells were pelleted and the supernatant was 
saved for subsequent infections. The double-stranded 
phage DMA was extracted by the boiling method (Maniatis 
et al. } . 
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rr.encl ex-rac-ion vas perfor:iied cn a ccnver.:- 
ent vclu:iie cf DMA sample, ■vpically 3~-veen ICC ul 
'C "1. T:::e DNA sample was d:lu"ecL 1 O.O^M Zris-HCl cH 
5 7.5, ! mM IDTA anl ar. equal volume cf vat er-saz urate i 
phenol was adced. The sample was Tcr-exec briefly a-ci 
placed 0- i-e for 3 After cerit r i f uga t i o n for 3 

min ir. a microfuge, the acuecus la/er was removed to a 
new tube and extracced once with an equal volume of 
^0 chloroform : isoam/laloohol (2^:1). 

4 . Ethanol precipitation : 

C^^A in an aqueous ouffer was concentrated ty 
ethanol precipitation. Tc tne DMA sample was aided 

15 i/iO volume of 3M sodium acetate p-i 7.5 and 2-3 volumes 
of cold eohanol. The DMA was preoipitaoed for 30 min 
at -TO^C or overnight ao -20^C and then pelleted by 
cent rif ugation in the microfuge for 15 min at 4=C. The 
pellet was washed oace -wi-h 200 ul of cold 30^ ethanol 

20 and pelleted again Tor lO min at ^="0, After air drying 
or lyophilizaticn, the pellets were resuspended in "the 
appropriate buffer." 

5 . Phoaohatase treatment of DMA : 

25 Phosphatase treatment of DMA was performed by 

adding 1 ul (25 units) of calf intestinal phosphatase 
(3oeringer Mannheiit) directly' to the restriction enzyme 
digest ion reaction and continuing the incubation for 30 
xin at 37 ''C. The phosphatase was inactivated for 50 

30 ain at a5^C prior to deprote iniza^ ion by phenol extrac- 
tion. 



^ • rill'in reaction with DMA polymerase 1 : 

DMA was resuspended in buffer containing 5CmM 
Tris-HCl pK 7.4, 50mM KCl, 5mM MgCl^, and 400 ::M each 
of the four deoxynuoleot ide triphosphates. Ten units 
of :<lenow DMA polymerase (3RL) were added, and the 
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ceaotLon was aliased to proceed for 15 mir. at rocoi 
temperature. The DMA was then phenol extractec and 
ethanol precipitated. 

- 7 . polynucleoTiide kinase reaction : 

The reaction (10 uD coatained: pc:/- 
nucleotide kinase (SRL), 150 ns or DMA, 1 ul of 1G:< 
kinase buffer (O.TM Tris-HGl pH 7.6, 0.1M MgOlp, oO^M 
DTT) and [3^?]-A7? (200-300 nCi). This was incubated 
10 az 37^0 for 30 :nin and then the DMA was purified using 
a NAGS column (Bethesda Research labs). 

3 , Digestion with restriction endonucl e as es : 

DMA was digested with restriccicn enaonu- 

15 Gleases (HEM) in 1 x "AA'^ buffer [10 x AA buffer is 53G 
mM Trls-acetate, pH 7.9, coOmM poi:as3ium acetate, 1GGmM 
magnesium acetate, 50mM di th iothre i t o 1 (DTT) and 1 
T.g/ml bovine serum albumin (nuclease free)]. Whenever 
possible, the concentration of DMA was kept below 1 

20 ug/25 ul. Incubation was at 37^C for 1-4 hrs for most 
restriction endonucleases except for Sai l, 5 an I and 
Mael digestions which were incubated overnight. 

9 . Analytical agarose gel electrophoresis of DMA : 
25 To DMA samples for gel analysis we added 0.2 

volumes of loading buffer (5 x electrophoresis buffer, 
O.OT^ bromphenol blue dye, 50mM EDTA, and 50% 
glycerol). Then the samples were loaded into lanes of 
a horizontal submerged electrophoresis unit containing 
50 a 1.0^ (w/v) agarose gel. The electrophoresis buffer 
was either 1 x TAG or 1/2 x TBE. The 1 x TAG is i^GmM 
Tris-base, ; OmM SDTA, adjusted to pH 7.3 with acetic 
acid. The 1/2 x T3E is 0.0U5M Tris-base, 0.045M boric 
acid, ImM EDTA, pH 8. The gel was run at 40-50Y for 13 
35 hr, then removed and stained with 0.5 ug/ml ethidium 

bromide for 30 min. The DMA bands were visualized on a 
long wavelength uV trans i 1 1 umi nat or . 
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' • rrsparE-jve agarose gsl elec trc chores 15 : 

The procecures ar.o -a-eriais are :hs sacie as 
for :r.e ar.al/tical agarose gsl electrophoresis. The 
5 cnTy hirTerence is ohe use oT lev zieloing poiro 

agarose, ranging in conoenor aoion fro^n 0.5 t;o 1.5% 
(w/r) depending cn the slz:e of the DMA Tragmeno 00 3e 
prirified. DNA restriction fragments vere excised from 
the IMP agarose gels afcer Yisualizat: ion vioh echidium 
1 G crcuiide . 

1 : . MACS purification ; 

Gel fragments conoaining DMA vere melced ao 
TO'C for 5 mln and diluted approximately 5 fold vith 

15 TH:i (lOmM Trls-HCl pH 7.5, C . 2M MaCl). The gel solu- 
tion vas applied 00 a MACS column (3RL). The column 
'-.-as washed vlth 5 ml of the same buffer. The bound TMA 
was eluted with 300 ul of either TZ2 ( i OmM Trls-HCl ph' 
7.5, i.OM MaCl) for DMA fragments, smaller than IGCG bp 

20 or TI3 (lOmM Tris-HCl pH 7.5, 2M MaCl) for larger frag- 
ments. The elated DMA was concentrated by ethanol ' 
precipi tatlon . 

12. DMA ligatloa : 

25 Reaccions for ligatlng cohesive ends con- 

tained: 1 ug DMA, I X AA buffer (see step S, above) 
ImM'AT? and 20 units of T4 DMA ligase (3RL) in a 20 ul 
final reaction volume. The ligation was allowed to 
proceed for 15-1 3 hr at 15^C or 1-2 hr ao rocm tempera- 

30 ture. cor blunt-ended ligations the reactions con- 
tained 1 ug DMA, 25mM Tris-HCl pH 7.5, 5mM MgCl^, 5:i:M 
DTT, G.25mM spermicine, 2aGmg 3SA, ■ mM hexamine cobalt 
chloride (HCC), 0.5mM AT? and 400 units T4 DMA ligase 
(ME3) in a 20 ul reaction volume. The ligation was al- 

35 lowed to proceed for 30 min to ' hr ao room 
temperature . 
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Bacterial T r ar.s f o r^.a" 1 o n Me the do 

] . ?rg pa racier, of " r ana forma" ion- 0001 oe" en c Z. cc-.i 
ce 1 1 5 : 

A CLil-ure of 2C0 ml cf scerile I crccn '^'as 
incGula^ed wLlh a sniali loopful of cc 1 i ceiis. Tnis 
was incubated wLz'r. shaking at 3T^C until the OD^q^ vas 
approxi Jiatel/ 0.5. The culture was placed on ice for 
10 min and centrifuged at 5,000xg for :0 mia. The cell 
pellet was resuspended in "^0 0 ml of ice-cold O.iM 
MgCl2, <ept on ice for 30-40 min and centrifuged again. 
The pellet was resuspended in 2 ml of ice-cold 1 OOmM 
CaClp, transferred to a sterile test tude and incutated 
on ice for 2^ nr. The competent cells were then ali- 
quoted and storec at -TO^C, 

2 . Transformation of £ . coli : 

An aliquot of frozen competent cells vere 
thawed on ice. To 50 of cells 0.1 to 1 ug of DMA 
was added and the mixture was incubated on ice for 30 
min. The tube was removed from ice and placed in a 
42^G bath for 2 nin. L broth (1 ml) was added and the 
transformation mix incubated with shaking at the 
desired temperature (usually 50^C or 3T''C) for 2 hr. 
Then one-tenth of the transformation was plated on L 
broth plates containing the appropriate antibiotic and, 
when necessary, XC-AL and I?TG were added. 

3 . DMA transformation of 3. subtilis : 

3_, sub t ilis cells were grown to early sta- 
tionary phase (change in :<lett units of ^5^ in 15 
min.). T r a n 3 f o rt:a t i 0 n followed established procedures 
( Anagnos topculos ^ aJ^. , 1981) (ref. 3). Cells (0.^5 
ml) were incubated with l-'O us of DMA at jT^'C for 30 
min with shaking, and then plated on T3A3 agar plates 
with an appropriate antibiotic. 
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4. Isolation cf plasrnil DMA frc:n 5. 5Ub"i:i5 : 

= :^:a fro:n ^, suboillc vas cboai-ec cv a 

-eorioci si::illar oo z::^e alKal 1 ne- 1/3 reohob excec" rna- 
5 p3ll3~5l oell5 vers re5 us peribeb in 5 -1 cf scluzion ^ 
C50:nM glLiccse, lOjnM EDTA, 25n~M Tri3-HCl (pH 5.g;, 10 
aig/ml l/sGz/Tie} ani incubatsc ab rocni bemperaburs for 
30 [Tiin. Then 15 "1 of soluoicn 2 ( 0 . 2N ^UQr., \% i'^'/^r) 
3DS) vas aadec anb incubaued on ios for \0 nin. 

10 rlnally, '.2 nil of 3''^ pocassiuni aoetatia (pH -.3; va3 

adceb anc inoubatab an acbibional 20 :rin on ios. The 
lyseb oallo vera oentrifugad 15 Jiin ao 1 5,000 rp^ in a 
Sorval 3S-3- rooor. Tne OMA vas preoipica-ed b/ abding 
an equal volunie of isopropyl aloohol and oencrifuged at: 

1 5 7,00C rpm. ' The pellat was r:e^'iszer,zea in 5 ml of 10:nM 
Tris-HCl (p?: 7,5), 1 uiM ZDTA (TH:], The solution vas 
phenol ex-racoed onoe and ohlorcforn exbracbed. D^JA 
;^a3 preoipitabed vibh ethanol and resuspended in 3 nl 
of TE. The .voluiae vas adjusbed bo 5.2 si br adding 

20 4.2g CsCl, 400 ul of ebhidium bromide a" 10 nig/:nl and 
TE, The solution was bransf erred bo a Becknan.- qaiok- 
aeal polyallomer csnbrifuge tube and centrifuged ab 
45,000 rpm in a Beokaian vbi55 robor for 13 hr. 



25 An oi body Produobion, Protein Chemistry and 
Hleobrophores Is of Proteins 

1 . ' Preparabion of an"lbcdy bo arblfloially z'fr.z^^.e- 
sizied peptides : 

S'jv.t'ae'tlz peptide of sequenoe ( GAGA G3 } 3 GAAGZ 

30 was ooupled to 3SA using the gl uberal dehyde prooedure 
of Kagen and Gliok Ci979;. The degree of coupling was 
[nonibored using traoe amounts of radioaooive lodinated 
syntheblo peptide. Peptide conjugates ao a 
Gonoer.crat ion of Img/ml in' complete rreund's adjuveno 

35 were used bo Immunize rabbits at day 0. Animals were 
re-injeobed with antigen in rreund's incomplete 
adjuvant at day 30 and titered ao day 50. Positive 
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sera was dececteci using a microciter RIA using "he 
synthetic peptide as antigen. Kagen and Glic< ■:9'^9/, 
in Methods of Rad i c i mmunc as sa/ , Jarre and Ber-an 
(eds.;, Academic Press, p ]23. 

A peptide c f 53 a i no acids corresponding to 
the SipIII sequence was prepared on an Applied 3io- 
systeTiS peptide synthesizer. The yield cf this :nater- 
ial, which has a nioiacular weigh-t of 3540 was approxi- 
Tiately C . 5 grams. The peptide was coupled co bovine 
serum albumin. The material was sent to Antibocies, 
Inc. for preparation of antibodies in rabbits. Anti- 
sera was obtained that reacted wioh synthetic pepcides 
of bo;:h the Slpl and SlpIlI sequences. These antisera 
have been useful for the detection of fusion peptides 
containing ' g 1 y- a 1 a sequences . 

Following the procedure described above an 
antigen was synthesized having the formula 
(V-?-G-V-G)3, which was coupled to keyhole limpet 
hemocyanin. Polyclonal antisera was then prepared as 
described above which bound to the SL? peptide. 

2. Polyacrylamide gel electrophoresis of proteins : 

Approximately 10^ Z_, coll aells from growing 
cultures were pelleted by cent r i f ugat 1 on at 10,000>:g 
for 5 min. The cell pellets were resuspended in lOO to 
500 ul of 2X sample 'oxifTev (ICOmM Tri3-nCl pH 5.3, 4^ 
SDS, 10^ B-mercap toethanol , 60^ glycerol or sucrose) 
and sonicated for 30 sec using a Tekniar sonic dis- 
rupter. Samples were boiled for approximately 5 min 
and 20 to 100 ul of the cell lysates were loaded on an 
SDS-polyacrylamide gel (7.5 to ^o% w/v). The gels were 
prepared following the procedure of Laemmli ( Nature , 
227:630-535 (1970)). The proteins in the gels were 
stained with Z1> Cocmassie brilliant blue in 10? 
methanol. 1*5% acetic acid for 1 hr and destained in 
10? methanol, 7.5? acetic acid overnight. 
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3 . Irnmuno bio t ti. cf proteins in gels : 

After prouein elec t rophor as i. 3 , one oT z'r.e 
flanking glass places was removed froa the pcl/acryl- 
anide gei. The gel surface vas ';^^e::"eci wi-h transfer 
5 buffer (25aiM Tris-HCl, 1 92"M glycine, 20% :necnancl;. A 
piece of ai t rcaellulos e paper (Sartorius, SMIiSOT) '^as 
saturated with transfer buffer and laid on the gel. 
Air bubbles 'becweerL the filter and the gel were re- 
mcYed, The gel and ni trocellullose filter were placed 

10 in the tvansfec unit as specified by rnanuf act urer (3ic- 
Rad) . Transfer was allowed to proceed at ZOO ziA for ' 
3-^ nr. Then the nitrocellulose filter was removed and 
stained with Ami do-Schwartz for 3 min (0.05* Amido 
black, 45? deionized H^jO), 45? methanol, 10? acetic 

15 acid) and destained in ^20. The filter was inoubazed 

for at least 10 min at rooa temperature in "3L0TT0" (5% 
w/Y nonfat dry milk, cOmM' Tris-HCl pH 7.4, 0,9% w/v 
N'aCl, 0.2% w/v sodium azidej. The filter was placed in 
serum appropriately diluted (1:50 to 1:500 ) in 0 . 5X 

20 Blotto (2,5? nonfat dry milk, 50inM Tris-:-fCl pH 7.^, 

0.9? iVaCl, 0.2? sodium azide) and was gently agitated 
for approximately 15 hr at room temperature. The 
filter was washed for 1 hr with 5 o'aar.ges of TSA (50mM 
Tris-HCl pK 7,4, 0.9? NaCl, 0.2? sodium azide). The 

25 blot was placed in 1 oml of 0 . 5X BLOTTO solution con- 
taining IxlO*^ cpm of the ''-^I -protein A and gently 
agitated for 2 hr at room temperature. The filter was 
washed for 2 hr with a minimum of 7 changes of TSA, 
rinsed once with deionized and air dried. The blot 

30 was covered with Saran wrap and autor adiographed . 



4 . Amino Acid Analysis : 

Amino acid compositions are determined by the 
PTC derivi tization procedure of Henrickson and Meredith 
(1934). Protein samples were hydrolysed with 5.7 M 
constant boiling HCl at 103^0 for 24 hours in vacuo . 
After reaction with PITC, amino acid derivatives were 
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detected at 254 -ai by HPLC vevecse phase Ghrcciat:ogr aph/ 
using a 'dewlezt Packard 1G9C system anc a Supelco G^3 
column (4,5 mm x 25 cm) witri a linear gradient of 0-50? 
acetonitile in 0.1M MH4OAG pH 6.73 as a mobile base. 
5 Henrickscn, R.L. and Meredith, 3.G. (1934) Amino 
Analysis by Reverse Phase High Performance Liquid 
Chromatography. Anal . 3iochem . 1 3'^ :65-74 . 

5 . Amino Acid Sequence Analysis : 

]0 The ^^-terminal amino acid sequence was 

determined by automated Sdman degradation using an 
Applied Biosystems Model 470A gas phase protein 
sequenator. The ?TH amino acid derivatives were 
analyzed by reverse phase HPLC using a Hewlett Packard 

15 1090 system and an Altex 013 column (2 mm x 25 cm) with 
a complex gradient buffer system. 

6 . Peptide Synthesis ; 

Synthetic peptides were prepared by solid 
20 phase synthesis on an Applied Biosystems Model 430A 
Peptide Synthesiser using the standard symmetric 
anhydride chemistry as provided by the manufacturer. 
The coupling yield at each step was determined by the 
quantitative ninhydrin procedure of Sari n _et_ aJ^. , 
25 (1981 ). The synthetic peptide was cleaved from the 
solid support and amino acid blocking groups were 
removed using anhydrous HF (Stewart and Young, 1934). 
Crude peptides were desalted by chromatography over 
Sephadex G-50 . Sarin, V.X., Kent, S.3.H.. Tam. J.?- 
30 and Merrifield, R.3. (1931). Anal . 3iochem. 131-^27- 
936. Stewart, J.M. and Young, J.D. (1984). Solid 
Phase Peptide Synthesis, Pierce Chemical Company, 
Rockford, II. ?p 35-39. 
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: . In T 1 " r o D :JA s y n f r. 5 3 i 3 : 

The ll,)J-zLLsozvzzylz'r:csj'ncTa-iizLzes, ccn- 
troilec-3cr3 glass cclurnns anci all synthesis reaien-s 
5 vere cc-ained rrc:n Applied Bics/sre-s, Tzszar CLzy, 
California . 

Syn-neiic cl igcnucleot ides wars prepared by 
tine pncspinite criester metdod vith an Applied 
Biosysterns Model 530A D^^A syntnesizer using a lO-rold 
IC excess of protected phcsphcraniidi t as and i u.-cle of 

nucleotide bound to zhe synthesis support Gclu.7in. The 
chefiiistries used for synZ'riesls are the standard proto- 
cols rscomniended for use vith ohe synohesizer and have 
been described (Matoeucci, ±^ , Journal Anier. Chem. 

15 5oc. , 103 :5 ■'35-33 V5 (19Si)), Deprotection and cleavage 
of the oligoasrs froai the solid support were performed 
according to standard procedures as described by 
McBride, ^ _al^. , Tetrahedron letters , 2^:245-248 
(1 933 ). The repetioiye yield of zhe synthesis as :nea- 

20 sured by the optical density of the removed protecting 
group as recommended by Applied Biosystems (T934} was 
"greater than 97.5.^. 

The crude oligonucleotide mixture was purified 
by preparative gel electrophoresis as described by the 

25 Applied Biosystems protocols of Mcvember 9, 1934 (User 
Bulletin llo, 13). The acrylamide gel concentracion 
varied from 10 to 20% depending upon the length of the 
oligomer. The purified oligomer was identified by U7 
shadowing, excised from the gel and extracted by the 

30 crush and soak procedure (Smith, Methods in Snzymologv , 
55:371-379 (1930)) . 



2. Sequencing of DMA : 

DMA sequences were determined by the following 
35 methods. Fragments containing the region of interest 
were cloned Into the multiple cloning site of M13mpi3 
or M13mp19 (Mani at i 3 _et_ al . , 1932, and Norrander et 
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al . , : 933). Single-stranded DNA was prepared and se- 
quenced by zhe prLner ex-ension :ne-hcd ( Sanger a^. , 
19^- and 3 1 gg i n e_^ a^. , 1933? asing 3 55 -dec xyacencs i ne 
5 ' -;alcna-cnlc;-crlchC3pdace (Mew Znglar.cL Muclear) as 
5 lacel. In sorne cases, reverse t r ans cr i p c ase (Molecular 
Genetics) was used to extend tne pri.uer, using tne 
dideox7:deox7nuGlec3idecri-priospriate ratios utilized by 
Zagursk/ e^ al^, ( Gene knal , Techn. (19 55) 39-94). 
Oeoxyadencsine triphcspnate labeled wi^n eicner cr 

.^.^3 .^3ed in tnese reactions. Ccaipression artifac-s 
w'nicn appeared in sc^ne G-C ricn sequences were overcor^e 
by eli:ninating dec :<y guanos i ne triphcapnate fro" che G 
reaction, and using deoxy i nc s i ne tripncspnace (?-l 
Biocneoiicals ) at a final concentration of 37.5 uM in- 
stead. In the ctner niixes, the concent rat ion of 
dideoxyGT? in the G reaction was 0.5 r.M. All sequences 
were run on 5 or S% polyacr ylami de gels containing 8 M 
urea (Sanger ^au^. l97S)u Primers used for sequencing 
were purchased frcai ?-L Biochemicals . Storage and 
analysis of data utilized software froa both DMAstar 
and International Biotechnologies, Inc. 



20 



3 . In vitro mutagenesis of cloned DMA : 

Plasmid DiMA (1 ug) containing the sequence to 
be mutated was digested in two separate reactions. One 
reaction contained either one or two restriction endo- 
nucleases which cleave at sites immediately flanking 
the region of interest. In the second reaction, the 
DMA was digested with a restriction endonuclease which 
cleaves only once at a site distant from the sequence 
to be mutated. The DMA fragments generated in the 
first reaction were separated by agarose gel electro- 
phoresis and the large fragment which lacks the se- 
quence to be mutated was excised and purified. DMA 
from the second reaction, the large fragment of DMA 
from the first reaction, and a synthetic oligodeoxy- 
nucleo-ide of 20-30 bases in length containing the 
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mu-ar.t aecuence wave .nixec Lz a, :nclar ratio c:" 1:^:250. 
Ths mLxt'dce vas cer.a-ured by heating at 'GO^C fcr 3 .ri" 
in 25 to IGO ul of ^CG-M ^^aCl, 5 . 5:nM Tris-HCl 7.5. 
3:t]M MgCl^' ^^>^ 3-:nercapr3ethancl . The ienazurec 

5 -ixiura vas reannaalad ty graiuall/ Icv/ering :he 3e:ii- 
peratiire as follows: for 50 niin, 4^C fcr 30 -lin, 

and O^G for 10 min. The rsao-ion va^ supplemen-ed wi-h 
O.omM decXirlbGnuGlecoide triphosphates, ' .t.M AT?, 4GG 
units of Td DNA ligase and 5 iinits of ooli D^IA pol/- 
10 merase large fragment and incuoated ao 15^0 for 12-15 
hr. The reaction :nl;<ture was uhen transformed into E_. 
ooli and ant Id io 1 1 o-r es i s tant colonies ware seleooed. 



Fermenoation Ccndi"ion5 

^5 The- fermentor is a 151 Chemap , 10 L working 

vclun:e. The c'Lilture conditions are: temperaoure = 
30«C, pH = 6.3; ^^aOH 2,5 M is used for ?H regulation. 
The heads pace pressure is below 0.1 bar. The dissolved 
oxygen is regulated at 50%, The air flow varies froni 

20 0.5 L/min to 20 L/min. The agitation rata varies 
between 200 to 1500 rpm. 

The fermentor is inoculated with a 10^ Cv/v) 
inoculum grown in medium A for 15 hours at 30''C under 
agi tation . 

-5 Medium 3 was the ferment or medium. The 

starting volume Was 5 L. 

When the glucose concentration reached 1j, a 
concentrated solution C5x) of medium 3 was added to the 
fermentor in order to keep the glucose concentration 

30 approximately at 1$. Vhen the culture reached an OD^cqq 
of 60. 0, the temperature was increased to '42''Z for 10 
min. then lowered to 39^0 for 2.5 hours. The cells 
were then harvested by centrif ugat ion and frozen at 
-70=^0 until processed. 
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Table 1 



Medium A: L3 Mediuni 
Cons " i " uent 
MaCl 

t r y p c c n e 
yeast ex~racc 
kanamyc in 



5 ^ • 



1 0 
I 0 
5 

5x^0 



Medium 3 



1 0 



15 



20 



Cons "i -uent 
MgSCu-7H20 

:< 2 s 0 ^ 

CaClp 

FeSOr^ •7H2O 
TE 

casaminc acids 
yeast extract 
glucose 
kanamycin 



U.5 

0 . 75 

0 . 3 
0.09 
24x10"^ 
7.5x10"- 
0 . 5 ml 
25 
5 

20 



5x1 0 
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Example 2 



A.3£em3l7 and Zxpregsicn of the Slol Gene 
; . SuniTiarv of "he achenie for a33e-T:bl-ng 
5 gene : 

An 13 bp DMA sequence -ha3 ccces for the t^osz 
frequent repealing oligopeptide in the sLi:< fidrcin 
0 rote in aiade by Bombyx Jicrl [Lucas, F. and K,M. P.udall 
('986) Extracellular Fibrous Proteins: The Silks. ? 

10 475-553, in Ccmprehens i re 3 iochenii sory , vol, 25, part 
5., M, rlorkin and r.H. Sto^z ( eds . ) " Flsevier , 
A-sterdam] was synthesized in vitro . Two single- 
strands were synthesized, annealed together and ohen 
the resulting dcuble-s t randed seg^ienos were 

'0 -lultirnerized head-oo-tail to generate oonca-aoiers of up 
to and exceeding '3 repeats. The structural gene for 
silk I that we proceeded to work with had 13 repeats 
that coded for the oligopeptide gagags , where g = 
glycine, a = alanine and s = serine, ^'e refer to this 

20 structural gene as the "monooier". we constructed 
"diaieric, triaieric, tetrameTlc, pentameric and* 
hexameric" SlpI genes containing 25 (SlpI-2), 39 



(31pI-3). 52 (31?I-4), 65 C31pI-5) and 73 (SlpI 



-6) 



repeats. There is a Short intervening sea.uence between 
25 each m.onomer unit. The assenibly is pictured as 

follows : 



Heceating CNA sequence 5'-3 3 T G C G G G C G C A G G A A G T 

C G C C G G C G T C C T T C A C C A-5^ 

30 
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2. Assemol'/ of t'ze '^mc ncmer 1 c SlpI structural gene : 

The "wc single-strands shewn above were s/n- 
theslzeci as prevlcusly tescriced. The strands were 
separately purified by gel el ec t rt phc r es i s , phosphor- 
5 vlated using T^ pclynucleatide kinase and then :ti;<:eu 
together and allowed to anneal. This resulted In the 
dGUCle-3 tranded segments aligning spontaneously head- 
to-tail in long ooncataaars. The phosphod 1 es ter conds 
between segments were formed with T4 llgase. The 

IG reaction was stopped by filling in the terminal cohe- 
slYe ends using the Klenow fragment of 0:fA polymerase 
The blunt-ended repeating DMA was then llgated to 
the Hin cZ- HEM site in plasmid vector p(jC12 (Veiera, 
al . , Gene 19:259-263 (i9S2)). The llgated DNA was 

15 transformed -into J^. coll H3101 and t ransf o rmant s were 
selected for their ability to grow in the oreser.ce of 
amplcillin. The DMA of potential clones was analy::ed; 
for size and orientation by ?.EM digestion and gel elec- 
trophoresis. DMA sequences were determined for iso- 

20 latea with large inserts that were oriented properly. 
The "moncmer" clone selected for subsequent multi^ 
merlzation had 13 repeats coding for the oligopeptide 
agagsg, and was named pSYTOS, The DMA sequence, de- 
duced amino acid sequence and REN sites of the SlpI 

25 insert and flanking regions of pSX703 are shown in 
Table 2. 
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1 5 



Gv-sjuvj^- AG\^ A_AC . GC i GGGGGCGG.-.GCA_AGTGGTGCGCGCGCA.GCA_AG7GG . GGGGGC 



20 



GCAGCAAG i^TGCG-GGCGCAGGAAGTGCTGCGGGCjCAGwAAGTGwTGCGGGCGCAC-CA 
i wv** i CACCACGCCCGCGTw*- * * GAGCACGCCCGCGTCC I '7GACCACGCGCGCGTCGT 

. ' - ^ " . " . * . - 



ISO 



25 



AGTGCTGCCGCCCCAGwAAG iVrfw i GCGGCCGCACGAAGTdSTCCCCCCCCAGGA-^GTGCT 
TCACCACGCCCGCGTCCT:CACCf.CGCCCGCGTCCT:CACCACCCCGCCGTCCTTCACCA 



30 





3 


A 


S 


3 


A 


Y 




A 




A 


A 


T 


T 







c 

5 



GCGGCGCCAGCAACTGCGACTCTACACGATCGCGGGCCGAGCTGCAATTC 
cgcgggcgtcgttcacggtgagatctggtagcggcggcctggacctt.a.^g 



« £ A r 3 £ 
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5 . Ccr.s'iruc- Ion cf tne axcr^sslon rector, pSY'^O^ : 

?la3:nicl pS?55 (10 . Bcehringer Manwhei.T:; was 
ciige^oec vicn Aazi: pnenol extrac:;ecl and e-zhanol 

orec L p i "at ad . T'ne D^JX was resuspsnded in 10ul of 
r H^O, Gne-nair of tnis O^IA v/as digested witn 

excnuclsase III in one follcwing mix: 5 ug DMA, ;0 ul 
lOX exonuGlease III baffar (oOOmM Trls-HCi pH 3.0, 
6.6fnM MgCl2, '0 niM 3 -mer cap t oe t nana 1 ) and 9 units oC 
exonuclease III in a tcoal yclur.e or" 2C0 -1. Sar.ples 

;C cf 20 ul vera taken at 0, 1, 2,5, 5 and 7.5 Tiin ana 
dilated i:nmedi ately in ^30 -il of the following buffer 
(30 mM sodium acetate, pH 4.5, 0.25 M MaCl, 1 "M InSCu; 
containing 5 ug tRNA and 36 unios of S'. nuclease. 
Incubation was at 50^C for ^5 "in and then tne reaction 

15 was ter-Tilnated by tne addition cf '5 ul of stop buffer 
(0,5M Tris pH 9.0, ^ ZomlA SOTA, 1^ w/v 3D3, 200 ug/"l 
tRiMA) . The samples were phenol extracted and ethanol 
precipitated. The resuspended DNA was digested wioh 
Smal R£K and elsctrophor esed through a ^% gel of lew 

20 melting point agarose. The gel band corres pcndi ng to 
the DMA fragment carrying the S-lactamase gene, the 
plasmid origin and the 3 -gal actos i dase promoter was 
excised from the gel and melted at 65^C. One volume of 
was added. The DMA in each sample (timepoint) was 

25 recircularized by ligation in the presence cf agarose. 
The reaction included 3 ul melted gel, 2 ul of ligation 
buffer (lOOmM Tris-KCl pH T.5, 50mM MgCl2, 50mM DTT , 
ImM ATP), 10 units T4 DMA ligase and was incubated at 
15^0 for 3 hr. Competent cells of JMIOi were trans- 

3° formed with the ligated DMA and t ransf ormants were 

selected by growth on L broth plates containing ampi- 
cillin (^0 ug/ml). Plasmid DMA was prepared from four 
transf o rmants . The DMA was digested with 3amHI HEM, 
labeled with ^^P-dGT? using the :<lenow fragment of DMA 
Polymerase I, digested with ?-/u I and then the smallest 
fragment was gel purified. The fragment from one 
transformant was sequenced using the Maxam and Gilbert 
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t;ecrinicue. Trie frag-enta cf tine ozr^ev z'r.ve^ plas^iiis 
were furtrier cigestec with TaqI anc elect rophc res ec on 
the same gel. Irie sequerLcel olasmlt hac. a fusion 
aetween the rnulzlple clcning site anc a ccsioicn up- 
5 stream frcm the M-terninal AT^ cf 3 -laot ajiase . The 
size c the SaoiH I -T ac 1 f ragmen t of t w o of t he c t he r 
plasmihs inhloatec a fusion between the niultiple 
clonirig site and the 4th amine acic of the S-laotaoiase 
gens. The DMA and corresponding amino acid sequences 
of the >f-terminal region of the aloered 3-lactaaase are 
given delcw, along with a circular ziap of HZN sites for 
pSYTOl (see Figure i). The aaiino acid sequence of 
Figure ; is met-thr-me t- ile-thr-prc-ser-leu-gly-cys- 
arg-ser-thr-leu-glu-asp-pro-his-phe-arg-val-ala-leu- 
1 1 e - p r o - p h e -p he -al a- al a- ph e - c YS - 1 e u- p r o - Yal - p h e - al a- 
his . 



1 n 



^. Insertion of -^monomer" 31pZ from ?5Y7Q3 into 

20 Plasmid p3:f703 was digested with Hind lll , the 

Gohsalve ends wer3 filled in using th'e Klenow fragment 
of DNA polymerase X and then digested with 3 am H I . 
Plasmid pSYTOi was digested with Xba l, filled in as 
above and then digested with 3amH I . The D^^A fragment 

25 frcm p3j:703 and the backbone of pSYTOl were then puri- 
fied by electrophoresis through a low melting tempera- 
ture agarose gel and purified with KAC3 (3RL) columns. 
The appropriate fragments were mixed, ligated, and then 
transformed into ^. coli JMi09. Transformed cells were 
selected by growth on L plates containing ampioillin 
(40 mg/ml), I?TG (oxlO'^M) and XGAL (20 mg/ml). Trans- 
formants were analyzed for plasmid contents and one 
(pSYToo) was selected for further study since it 
carried the insert of -the monomer 31oI-i secuences in 

-55 

the proper orientation, as determined by mapping of 
sites. Although the entire DNA sequence was not deter- 
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Tiir.ed for pSY756, z^r.e Jur.cticr.s i:e^ween zr.e Lr.secz and 
vector were verified a3 correct restrictior. ae cue noes 
for X t a I , u p 3 t r 9 aL-n an d 3an: H I , d c w n s t r e am . 



- 5 . Mul- iT:erizat i on of the Slcl gene cf pSY^r^ : 

Plasniid pSY7G3 was digested wicn tne HIN Smal 
and the DMA fragmen- carrying the coding sequence for 
the polypeptide arg ( ala-gly-ala-gly-ser-gly ) ^ ^thr-leu- 
glu-asp-pro CRC AGAGSG) 1 3T1H:d?) was purified as in^ 

10 a'oove. Plasriid pSYr35 was digested with S^r.aZ , ce- 
proteinized and then ligated with the purified DMA 
fragment from pSY^^OS. Tr ans f o r:nan- s of coli JM-C9 
were selected on aediuni containing ampioillin. Clones 
were found to contain 2 units (di:ner pSY852), 3 units 

15 (tri.-ner pSr335), and 4 units (oeoramer pSY915) of the 
original nioncraer sequence of the pS'::70 3 clone. 
SiiTiilariy, pentajiers and hexamers have also 'oeen 
constructed. All of these plas^nids are genetically 
stable and produce the gly-ala peptide as a fusion with 

20 S -lactamase . 

6 . Expressioa of the Sip I gene fusion to the 3- 
lactamase protein : 

Synthesis in coli cells of the SlpI peptide 

25 as a fusion protein with S-lactamase was detected by 
immunoblott ing (Western) analysis . Anci-'^Slp'' anti- 
bodies were raised against a synthetic silk peptide. 
Fusions between S-lactaraase and Sip: were also detected 
with antibodies raised against the ^, coli 3-lactan2ase . 

30 As shown in figure 2, this antibody reacts with di:ner3 
and trimers of SlpI fused to the E_. coli 3 -lactaciase . 
The SlpI insert proceeds the fifth amino acid of the 
signal sequence for this enzyme. The 3-lactamase anti- 
body (Figure 2A) detects both the unprocessed fusion 

35 proteins as well as the processed mature enzyme which 
appears as the major antigenic band in this figure, at 
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a'ocnz the 25 k3 pcsi-ion. The mobili-les all Sir- 
containing ccl/pe?" Ides are ancnialCLisly slew ani the 
prctelns are not as large as they appear on -he gels. 

The an-i-Slp anti.tcc.7 also is useful in he- 
5 tecting these fusion procuots. lanes 2-5 of Figure 23 
represent 4 separate clones that ccn~ain diaier fusions 
of SlpX vith 3-laotamase , while lanes 5 and 7 are from 
two clones containing triiner fusions. As can te seen 
the antigenicity of the tri:ner is consideratly greater 

10 than for the di:ner. It is known froai prior experi:neno3 
chat fusion proteins containing only a Jicnomer of Sip I 
are not daoecced at all wioh this anti-Sip antitocy. 
The increased antigencity cf the trimer peptide allows 
it to be detected as a processed fusion with the 3- 

15 lactaniase signal peptide. The processed forrrt is seen 

at about the 33 kD position in lanes 5 and 7 of Figure , 
23. The appearance of norr.ally processed .3-lactanase 
mature enzyme (detected with 3-lactama3e antibody) as 
well as a peptide corresponding to the fusio-n between 

20 the SlpI-5 trimer and the signal peptide of S-lactamase 
(detected with gly-ala antibody) suggests that despite 
the Insertion of Sip I sequences within the signal 
sequence, normal proteolytic processing of the enzyme 
occurs ir. col i . 

25 

7. a. Expression of the SlpI gene by fusion to T7 genes : 
The SlpI sequence has also been expressed as a 
fusion protein with both the ge^ne 9 and gene 10 pro- 
teins from bacteriophage T7 in coll . The construc- 
30 tlcn is diagrammed in Figure 3. Plasmid p3Y915 (con- 
taining the SlpI -4 tetramer) was digested to completion 
with RH:n Sai l and partially with SamH I . The DNA frag- 
ment containing the SlpI-4 tecramer was purified and 
then cloned in plasmid pSY 1 i 4 (pG2 of Promega Biotech) 
which had been digested with RENs Sail and 5am HI . From 
this intermediate plasmid, the tetramer insert of SlpI 
was removed with the HF^s AccI and FcoRI. This frag- 
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oient was then cloned in pS1^533 (?3R322 containing the 
complete TT gene 9 sequence; pAR^4l of 3 tudi er JJ^ a^. , 
(1936)) vhich was digested with Ecc RI and Asu ll. In 
the resulting plasmid the SlpI tetra.Tisr is fused to the 
gene 9 translat ional reading frarne near the C-terr.inus 
of gene 9. This plasmid was then used to transfonTi Z_. 
colL strain 0-^3 (strain HMSIT^ (aDH:3) of Studier, et^ 
ai . , 1986 ) which contains the 77 ExVA polyrnerase gene - 
inserted into the chrornosome under transcriptional 
control of the t?tg- i nduc i cl e 3 -galact os i das e prc:nocer. 
In this configuration, expression of the SlpI-4 
sequence is dependent upon production of the TT RMA 
polymerase which itself is controlled oy the IPTG- 
inducible 3 -galact os i das e promoter. As shown in Figure 
.5 U3 and UC, when these cells are induced with I?TG a 
protein product of the gene 9/Slp:-^ fusion gene is 
synthesized and is detected with antibody to the syn- 
thetic Sip peptide. The fusion product migrates in the 
gel as if it was 32 kD in size. The size expected is 
only 65 kD. The anomalous mobility is characteristic 
of the unusual amino acid composition (rich in glycine 
and alanine) and is seen for all Sip-containing 
products . 

In like manner, plasmid pS'r633 (pAH2113 of 
25 Studier) containing the promoter region and the first 
13 amino acids of the T7 gene 10 protein, was digested 
with RE^^ SamH I , filled in with the Klencw fragment of 
DMA polymerase and then digested with REM EcoE l . Into 
this linearized plasmid was cloned the Asu II- Eco RI 
30 fragment of pSY533. containing the SlpI-4 tetramer. 
This ligation creates an in-frame fusion of the silk 
tetramer following the thirteenth amino acid of T7 gene 
10. The latter fusion product may be used for spinning 
without further processing since the M-terminal 13 
35 amino acids are only a small part of the large SlpI 

protein. Although the fusion product is about 3C kD in 
size, it has an anomalous mobility and migrates as if 
it was larger, 50 kO. This is shown in Figure ^A. 
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The plas-ilcs pG9/Slp:-^ and ^g:Z/SIzZ-- were 
further ixprcred inserting a kanan/c i n-r e5 i s :ance 
gene in the 3-lac-ar:ase gene in the orientation 
opposite to -he expression sys-eoi. Thus, any low 
5 level expression frctn tne T" system hoes no 3 lead to 
elevated S-lactaoiase ac-iyity. Such acoivity slninared 
the ampicillin in the oiediun that was added to seleot 
for niaintenance of the plasmid. When the aarpioillin 
was depleted the plasniids vere lost froui the oulture. 

10 The kana;7iyoin-re3i3tanc3 gene circuaivents this proolea 
and represents a significan- i:npro vement in one T7 
expression sysoeoi, especially for large scale 
Guli:ares. The kanamyo i n-res is tance gene (originally 
fros Tn9G3) was isolated from a plasmid puC4K (7eira, 

15 J. and J, Messing, 1 932, Gene. JJ_:259-253) was a 
Hi no I I fragment. 

7 . b . Feraientatioa and purification of 31dI-^ : 

^. coll strain 0-43 carrying pSI997 was grown 

20 at 37''C, using a Chemap- or a Braun fermentor, in lOL of 
L3 to an CD (Klett units) of 300 (3x10^ cells/ml). "The 
T7 system was then induced with the addition of 3.5 mM 
I?TG. After 150 min the cells were concent raced \0x 
using a Millipore filter unit (Pelliccn casseote 

25 system, 100,000 molecular weight cut off filter). The 
cell suspension was then frozen at -70='C until 
processing , 

The cell suspension was melted in a water bath 
at 42='G and lysed in a french press, and the lysate was 

30 spun at 1 25,000x5 for 1 hour at 25''C. The cleared 

supernatant was treated with DNAase (250 um/ml) for 15 
min at room temperature, then filtered through a .^5 
sterile filter. The filtrate volume was measured and 
incubated in ice with slow stirring. Then 231 mg of 

^5 ammonium sulphate were added fcr each ml of filtrate 
over a period of il5 min. One ml of MaCH for each 10 g 
of ammonium sulphate was added to neutralize the pH. 
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Aft:er 2 hours oC ccn-in-jcus stirring the .-nixture was 
spun az 9,0CC:<.g for 10 nin. The pelle" waa res U3 pe nde ci 
in :/iC or tne crlginai filtraze volurne using aia-illec 
wa-er. The centrifugaticn and resuspension was 
5 repeated three *i:r.es. The pelle" was r es us p ended in 
l/iO cr the original rilnrate volume in discilled 
water. Samples were analyzed for procein concentra- 
tion, amino acid compos it ion and protein sequence by 
standard methods. This is one of several meohods for 
:0 obtaining the produce. This meohod resulos in an SlpZ- 
4 product that is greater than 9Cj% pure. The amino 
acid composition is almost entirely gly, ala and ser, 
as expected, and the M- terminal amino acid sequence is 
that of the gene 10 leader. 

I 5 

3 . Controlled expression of the TT HMA polymerase 
gene in Bacillus subtil is : 

The coding sequence of the TT H>rA polymerase 
gene ( T7 gene 1, T7 nucleotides 3' 23 to 5645) from 
20 plasmid p3r553 (pAR1l51 of Studier, et al . , 1935) was 
modified by in vitro mutagenesis of cloned DMA. We 
inserted the recognition sequence for the restriction 
endonuGlease Ndel at position 3171. Qsing an oligode- 
oxynucleo t 1 de which was synthesized as previously 
25 described, the T7 gene 1 sequence was changed from its 
■ natural sequence, TAAATG, to the modified sequence, 
CATATG. 

Similarly, the upstream regulatory sequence of 
the Bacillus suboilis gene spoVG , obtained from plasmid 

30 pCBl29i (Rosenblum, J_t_ au^. , J. Bacteriology , 1 43:341- 
351 (1931)), was modified by in vitro mutagenesis at 
position 35 (Johnson, e_t_ , Mature , 302:300-304 
(1933)) such that it also includes an Nde l cleavage 
site. The upstream regulatory sequences of the s pc VG 

3 5 gene were then li gated with the ceding sequence of the 
T7 RNA polymerase gene via these novel Mde I cleavage 
sites. After transformation of £. coli HB1 0 i , the 
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plas~id GcrL-ent5 cf indirllual ampl cillin-res Is t an- 
iaola-as were checked bv res i:rl ct i c flapping. The 
ccrrsct: ccnstruo-icn vas nanieci p3Y5-9. 

Plasmil ^^^A ccntaini-g the 5pc7a:T7 HNA pcl/- 
Tierase f^sic::: gene (pSY5^9; vas fur-her :T:ccifiec tc 
include a chloramphenicol-res Ls cance gene that func- 
ticns in 3_, aubtiiis . rirst the N'del to Sail fragment 
of about 1200 base pairs frcm plasrnid dGR7'-?^3 
(Goldrarb, et al . , Mature , 293:309-31^ (193;)) was 
isolated. This fragment carries the ?43 prcmoter of ^. 
sub " ill 3 and a-n adjacent chlcr a-pheni cc 1 acetyl- 
transferase gene from Tn9, After filling in all the 
cohesive ends using the Klenow DNA poly- erase reac^icn, 
this fragnient was inserted into the Xba l site within 
the aiul t iple-cloning site of pUC1 3 (Veiera, et al , , 
Gene , 1 9 : 259 -253 ' ( 1 93 2 ) ) . Ampicillin and chlcrarn- 
pheni CO 1-r es i s tan- transf ormants were selected for 
further use. The correct plasaid construction was 
named pSY530. The Smal to Hin di endonuclease cleavage 
fragment from piasmid -03^530 containing the chloram- 
phenicol acetyl transf erase gene fused to the ?^3 pro- 
moter sequence was gel purified and blunt-end ligated 
to the ? Tu I site of piasmid p3Y5 4 9 that had been 
treated first with T4 DUk polymerase. The resulting 
piasmid, p3Y85o, was then transformed into 3_. subt ilis 
I 1 53. Because piasmid p3Y356 is unable to replicate 
autonomously in _3_. subt ilis , stable transf crmants 
resistant to chloramphenicol must result from the inte- 
gration of the piasmid into the 3_. sub t ilis chromosome 
(Ferrari, ^ a^. , J. Bacteriology , 15 4:1513-1515 
(T9S3)). The integration event, facilitated by homolo- 
gous recombination, most likely occurred at either the 
3 poVG or the ?43 loci of the bacterial chrcmosome 
(p3Y35 5 contains DMA sequences homologous to the _3_. 
sub til is chromosome at only these two sites) , The 
resulting strain, '^BIPol'^, was grown both in the pre- 
sence and absence of chloramphenicol in order to deter- 
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rrir.e t'r.e s-arill-y of the selectable a:arker. Expres- 



sion of 



r"7 



poly-n erase was obtained 



apparen: eifeot: cr. One growtn or viabilioy or onis 
3 c r a 1 n . 

5 

9 . a . rixoresaion of a o las.-ni borne oargeb gene 

(kanamroin-resisoance) in 3. 3ubolll3 soraln 
5:?oL : 

the Sbaonylococcus aureus plasrrxic pU3:iO 

•0 (laoey, al.- ' Med. Mi orcb i o logy . 7:235-29^, i^''^) 

whicn contains the gene coding for resistance to the 
antibiotic kana-iycin was used to test the expression of 
the grcwth-regulat ed spoVG:!? HNA pclynierase gene of 
s-rain 31?ol. An ; c o Fil - 3a:n HI fragnient of phage 77 i:NA 

15 (positions 21,^02 to 22,358 containing the T7 gene 9 
promoter sequence was purified frc-n plasmid ?AH44l 
(Studier, ^ , 1 936 ). This DMA fragnient '^as ligated 
into pUSnO between the Sco Rl and 3agiH I restriction 
endonuclease sites. The resulting plasaiid, p3Y952, 

20 contains the TT-speclfic promoter in the sane orien- 
tation as the kanamycin-resistance gene. Plasmid 
pSY952 was transformed into _3. subt ilis II 53 and 3I?oL 
and these strains were analyzed for the level of ex- 
pression of the polypeptide encoded by the plasmid- 

25 derived kanamycin-resistance gene. Approximately 10" 
cells from growing cultures of 1163, II 63 containing 
pU3110, 1163 containing pSY952, 3I?oL, BIPoL containing 
pUSnO, and 3I?cL containing p3Y952 were obtained at 
several times during the growth and sporulation cycle. 

30 The proteins in these cell samples were processed and 
analyzed by polyacr ylami de gel electrophoresis. 

Because the rate of transcription from the 
apoVG promoter increases as a function of cell density 
and reaches a maximum during early sporulation, an 

35 accelerated accumulation of the target protein is ex- 
pected in the 3I?cL strain containing p3Y952 during 
growth as the culture enters sporulation. The results 
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Increases In abundance as tne culture apprcacnes ann 
enters stationary phase. The size cf :he protein is in 
agreement v/itn the preciccec size of the kana^iycin- 
5 resistance gene prccuco (Sacaie, ^ , J. 

Saccericlcgy , i4i:il73-n32 (i980 )) encodec in pSr952. 
This protein is nco presenc in 3i?cL or Ii63 concaining 
?3Y552 which lacks the s po7G -r sgulat ed T7 RllA 
polymerase gene or in 31?oL con-aining pUHllO which 
^0 lacks the T? pronicter sequence. The maximum accumu- 
lated ieyel of target protein after Z'4 hours of growrn 
In 3^?oL conoaining pSy952 was 20% of che to-al cellu- 
lar protein as determined by dens i tome cry . 

15 H::cpre3sion of Slpl-U in 5. subtilis : 

Plasmid pGiCSlpI was digesoed with lOo RI 
RiN. After filling in the cohesive ends using the 
Klenow DUX polymerase reaction, the DNA was digested 
wich 3g1 II RiN. Plasmid p3Y552 was digested wi-h Smal 

20 and 3am HI R^Ns . The two plasmids were then purified by 
electrophoresis bnrough a low melting temperature 
agarose gel and purified with NACS (BRL) columns. The 
D^rA fragment of dCmOSIpI was ligated to the backbone of 
p3Y562 and transformed into coli containing ampicil- 

25 -ia C^O ug/ml). Transf ormants were analyzed for plas- 
mid contents and cne (p3Y5o2/Gl 0 /SlpI-4 ) was selected 
for further study. 

Competent cells of 3_. subtilis BIPol were 
transformed with p3:r662/a'[ G /31pI-4 and incubaced at 

30 37^C wi-h shaking for 90 min. The transformation 

mixture was then diluted 1:100 in fresh 13 containing 
10 ug/ml of tetracycline and incubated at 37^ C wich 
shaking. Samples were taken and equal numbers of cells 
were lysed and loaded on gels for separation by SDS- 

35 PAGE. Immunoblot analysis was performed using anci-Slp 
antibodies to detect the synthesis of the gene 10/SlpI- 
4 fusion orotein. 
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T?:e expression of t^e SlpI-^ oclypepciae in ±, 
subtilic was de-ected by ito s e roreac i vi ^ 7 wi-n anbl- 
S:? an-ibcd/, a^ter *:ran3fer of tne cellular croceins 
frc" tna pcl/acrylarr.ice gel to a ni c rocellulcs a 
5 ril-er. Yerified tnat: tde 3 er^ reac t: i ve prc'ein waa 

tne product: of tde SlpI-4 gene by exhaustively treating 
the cellular proteins with CM3r. This should cleaye 
after :nethionine residues, but since Slp:-4 lacks 
methionine it will remain intact. The CM3r trea3:neno 
10 eliminaoed greater than 96? of the proteins stainable 
with Cocmassie blue dye. And as expected for a protein 
lacking methionine, Slp:-4 remained intact and still 
reacted wi-h an^i-31p serum. 

1 =; ^ Ixampl e 3 
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25 



30 



Assembly and Expression of the Sip II I Gene 
1 . Summary of the scheme for assembling the SlpIII 
gene : 

The synthetic SlpIII gene codes for a protein 
similar to the SlpI gene and to the crystalline region 
of the silk fibroin protein made by the silkworm, 
3ombyx mcri . SlpIII more closely resembles the silk 
fibroin molecule because it includes the amino acid 
tyrosine at regular intervals (about 50 residues), 
whereas multimers of SlpI do not. The SlpIII gene was 
assembled from smaller parts. rirst, three double- 
stranded sections of DMA of about 50 bp in length were 
chemically synthesized. Each section was cloned by 
insertion into bacteriophage Ml 3 ^nd the DMA sequence 
was verified. These sections were then removed from 
the vector and linked together in a specific order. 
This linkage of about 1 30 bp is named the SlpIII 
'^monomer''. '^Moncmer3*^ were then linked in a specific 
order to yield dimara, trimers, tetramers, etc., of 
SlpIII. The multlraers were then cloned either directly 
into plasmid expression vectors to defect the SlpIII 
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^cotei:i cr iniriaily ir:t;o an adapter plaaniid. Inser- 
tion or zhe SlpiII DMA into the adapter allows for 
further gene nanipulaolon and is furrher desorioed 
laoer. The asse-dly achenie is pict'j.red as fcLlovs: 

5 

Synthesis of dcuole-s trance d DMA seooions 



iec-ic:: : 

1 0 



Sec-icn a - 



I 5 iecticr. 5 - 



23 



The DMA and corresponding amino acid sequences 
25 of the three sections of the SlpIII gene are shown in 
Table 3. 
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The ccucle'S t randed jliX sequsnca is shc^r, in 
tne 5' ^0 5' cirection. The acnino acicis (g = gl/ci-e, 
a - alanine, s = serine, y - tyrosine, ccde'l a/ the 
sequence are shown i.-nmediatelj belcv each seccion. 
5 Recognition sequences for cleavage tr restriction 
endonucleases are shown abo^e each see" ion. 

The above six single-strands vere synthesized. 
After synthesis, the strands of DMA vere purified and 
the homologous strands were annealed. About 1 ul (0.5 
lO ug) of each strand was si:<ed with 2 ul of lOX AA 

(description) buffer and 15 -l of sterilized ceionized 

in a 1.5 3l polypropylene Eppendorf tube. The tube 
was placed in a boiling water bath (500 :nl in a 1 liter 
beaker) for 10 min and then the beaker was razored frcn 
15 the hot plate and allowed to cool on the bench to roc:: 
teoiperatur e . This required about 1-2 hr . 

Each of the three double-stranded sections was 
cloned separately into L4i5mpi3. Section 1 was ligated 
between the S"a l and SaniH I restriction sites of the 
20 multiple-cloning site. Section 2 was ligated between 
the SamH I and Pst I sites. And section 3 was inserted 
between the ?st l and Hind lll sites. The respective 
clones are: M133ipi3,1, Mi3mp!3.2, M13nipi3.3, The D?iA 
sequence was determined for each cloned section. One 
25 representative of each section that had the correct DMA 
sequence was recovered and became the material for the 
next .step: assembly of the "monomer'*. 

3 . Assembly of the "monomer'^ of SlpIII : 
30 The DMA sections 2 and 3 were isolated by 

digestion of the Ml 3 clones with restriction enzymes: 
for section 2, M13nipl3.2 was digested with- 3amH I and 
?st 1 ; for section 3, Mi3niplS.3 was digested with ?st1 
and Hin dlll. The two sections were purified and mixed 
35 together in equal molar amounts with M13mpta.l that had 
been first digested with 3amHI and Hind lll. 't^ DMA 
ligase was added to link the homologous overlapping 
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enis in t'r.e oclev ^-2-3. ^'^e to tne hy c r 1 d i z a : i c n 
specir::!:/ of z'r.e ccrieslve ends, -ne ti^ree sections 
are efficiently linked in only cnis order. Tlce ::;a 
sequence cf ine cloned r:ono:ner in one asserTicly nacied 
5 Smc ■ 3 . 1 . 2 . 3 was decermined to de ccrrecc and as sncwn 
in 2 adcve. 



4 , Multimerizat ion of the " i:icno:ner cf SloIII : 
In order to prepare large amounts of tine 

10 '^moncuier'^ strucoural gene ve first sudcloned tne 

^^T.cnorr.er^' into the plas.-nid vector pUGia. M 1 3^"np ^S . 1 . 2 . 3 
was digested wioh Zco RI and Hin dlll restriction 
enzymes. The SlpZi: '^:r.o ncme r was gel purifiec and 
ligaoed into pUCi2 digested with Eoc RI and Hin dlll . 

'5 The resulting plasnid DMA was prepared, the '^7lcnoaler" 
was released frcni the vector by digestion with 3an l REM 
and the frag.Tient was gel purified. 

To create iiul c i :ner s , ^» monomer'' DMA with 5 an I 
ends were linked by ligation. The nonpal indromi c 

20 terminal Ban I recognition sequence allows linkage only 
in a head-to-tail order. The extent of mul t imer i za t ion 
is monitored by gel electrophoresis and staining the 
DMA with ethidium bromide. Multimers of more than 20 
units have been obtained by this method. 

25 

5 . Cloning of the multimers of SlpIII : 

Plasmid pCQV2 (Queen, _et_ aJ^. , J. AddI. MqI. 
Gen. , 2:1-10 (1933)) was digested with Sco RI and 3amH I 
restriction endonuc leas es and a fragment of about 9C0 

30 bp was purified. This DMA fragment contains the bac- 
teriophage lambda cI-557 repressor gene, the closely 
linked rightward promoter, and the beginning of the 

ore gene. Plasmid pSY335 (described as pJr75i in 
Ferrari , et_ 3^. , J. Bacteriology , i5l:55o-5o2 (^935)) 

35 was digested with Sco RI and 3amH I restriGr:ion enzymes 

and subsequently ligated to the DMA fragment of approx- 
imately 900 bp of pCQV2. The plasmid obtained from 
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"r.is ccns t ruo c i cr. , pSi^^' , expresses tr.s 5- 
5alact:c5 i iase gene a: 3T'C and 42^0, z^z net a:: 30^C 

In 3nl3 acproaah "ne 31dI:: gene I3 first 
5 cloned incc an ''aaapter'' sequence in an inzernecia^e 
plaanic ant tnen sutclcnec zo the expression s/soems. 
The adapter sequence has tne fcllowlng useful 
features: a unique cenoral 5an l REN site, three unique 
HEM sites to either side of 3 an I', information coding 
10 for protein clearage at either r:ethlonine, asparoaoe- 
prollne or arginlne antlno acids and small size. The 
Ban I site Is the point of insertion for the Slpli: 
multljiers vlth 5an l ends. 

The adapter was synoheslzed with the Applied 
15 SlosYSOsms 5SOA Synthesizer, cloned In Mi3mpi3 and the 
DMA secuer.oe verified. The adapter was then sutoloned 
into a specially-constructed plasmld vector that lacked 
5 an I HiN sites. The recipient plasmld was made as 
follows. Plasmld pJHiGl ( Ferrari , ^ al_. , i933 ) was 
partially digested with Ahal ll restriction enzyme and 
rellgated. Transf ormants of Z, coll H3i01 were 



20 



selected on medium containing chloramphenicol (12.5 
mg/ml). After restriction analysis of several isolates 
one plasmld was chosen, pSY325 (Figure 7). This plas- 

25 mid contains only the chloramphenicol-reslstance gene 
and the replication origin (from p3R322) of pJHiOl. 
After digestion to completion with Xho II, pSY325 was 
ligated with the gel-purified adapter. The result was 
the adapter-plasmid, pSr937, and Its new H3::i sites were 

30 verified. 

The SlpIII multimers were cloned into the 5an l 
site of p3'r937 (Figure 7). Positive clones were 
Identified oy colony hyoridlzation and with the lower 
strand of section ; of SlplZI as the DMA probe for 
35 hyoridlzation (prcde sequence shewn in Table 2). Posi- 
tive clones were characterized by gel electrophoresis 
for the size of the inserted aultimer. Finally, the 
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SlzlZZ 53C'^enc35 ^ere subclc-ed using tne HIN size in 
the flanking acapcen regions to soecirio locations of 
expression p 1 a s i Is . 

The SlpZi: protein had the following aniino 
5 ao i d ccc:zc5 i o ion : 

Sip::: n 73 aa mw 33 , oco 

C f .71 ; DP V V L Q R ?. D W H: M P G V T q L M R L A A H ? ? ? A S D ? M 
^0 GAGS (GAGAGS)5 GAAGY 

C ( G A G A GS ) g G A A G:f ] 3 

GA GAGS GAGA GSGAGAMD?GRYQLSAGa>:H:rQLVWCQK 



(fT.) intends the initiation coder. 

1 5 

SlpIII Expression Vector 

Plasoiid DNA p3Y1086 is a pSY937 derivatire 
containing 19 repeats of SlpIII (3.5 ko). This plasmid 
D^^A was digested with Nru l and Pvu II and the fragments 

20 separated by agarose gel electrophoresis. The purified 
SlpIII multimer was then cloned in plasmid pSY751 
digested with Pvu III RZM. Several clones were analyzed 
and one (pSYIOOS) was chosen to be used in expression 
experiments and SlpIII purification. 

25 The ampicillin drug resistance gene of pSY1G03 

was substituted with the Kanamycin osarker from pSY1 OTO 
(produced by digestion of pSYo33 with Dra l and Ssp l and 
insertion of Kan"^ obtained by Hin di digestion of 
pi;C4K:) and the subsequent pi as aid was called pSY1i35. 

30 By removing the SlpIII portion of plasmid pSY1l36 with 
3an l, a new plasmid, p3Y1252, was generated. This 
plasmid contains a unique 3an l site which allows for 
the direct ligation of fragments containing 3an l ends 
obtained by polymerization of monomers. This plasmid 

35 has been used to generate plasmids containing inserts 
for the following proteins: SSLPI , 2, 3, and 31p4. 
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?rcduG-icn and Purl f i ca-icn of SloIII 



Cell 



cell are cultured in zr.e follzwL-g mediuni: 
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gluco5 e 
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1 5 

An overnight culture CcOO .il - 1 1) which had been 
grown a- 50 °C was used to inoculate 575 L o:" T^edia 
contained in a 500 L fermentor. Fermentor conditions 
include a tachometer reading of 100 rpm, vessel back 
20 pressure of 5 psi and an air flow of 170 l/.nin in order 
to maintain dissolved O2 at greater than 50*. 

Glucose (1 g-nVl) and ampicillin (0.05 g/1) 
were added to the fermentation when the culture reached 
an of 1.0 and again at 2.0. When the culture 

25 reached an OO^-q of 2.0 the temperature was increased 
to 42°C for 10 min and then lowered to for 2 

hours. The culture was then chilled to 10 ="0 and cells 
were harvested by cent rifugat ion in a continuous cen- 
trifuge and frozen at -70='C until processed. Yields 
frocn two separate fermentations were 7.3 kg and 5.2 kg 
wet weight of cells. 

It should be noted that other media can be 
used and, with different plasmi da , . var i ous selection 
conditions can be imposed (i.e., substitution of kana- 
55 mycin selection for ampicillin) . These conditions have 
been used in laboratory scale fermentations (10 L 
volumes ) . 



NSDCCID: <WO ■3803533A1> 



wo 8S/03533 



PCT/LS8T/028 



Cell LysLz 

Met nod ' . Cell5 are "'nawec ani auapencec to a 
corcerLt rat ion of 1 wet veight/6 1 In 50 :nM Tris-HCl 

pH 7.0, 1 T.M EDTA ani brcKen 07 2 passages z'cico^^^h an 
5 APR Gaulin cell disrupter at 3CC0 psl. During tnis 
lysis p race dure tne cells are kept cold vi-:n an ice 
batn. The cell lysate is then centrifuged at 25,00Gxg 
with a continuGUS centrifuge, such as the T2-23 rotor 
in a Scrvall RG53 refrigerated cen-rifuge operated ao 
lO u^c. Under these conditions greater than 90% of the 

3 loll I produced can be found in the pellet. The super- 
nate does contain some product which can be re cove red 
by llH-^SZ^j^ precipitation as described below. The pellet 
is extracted with Li3r as described below. 

15 Method 2 . frozen cells are thawed and re- 

suspended to a concentration of 1 kg wet weight/6 1 in 
50 aiM Tris-HCl pH 7.0, 10 .nM ZDTA, and 5 aiM ?MSF to 
inhibit protease activity. Cells are stirred in this 
buffer at room temperature for 0.5 to 2 hours, then 

20 lysozyoie is added to a concentration of 1 g/1 and 

incubation is continued for 20 min. 3-Mercaptoethanol 
is then added bo 70 mM and the detergent N?40 is then 
added to a final concentration of 1^ for 20 min while 
continuously stirring the cell suspension. Then MgCl2 

25 13 added to 50 mM followed by DNAse at a concentration 
of 1 mg/1 and incubation is continued at room teapera- 
ture for 20 min. The cell lysate is then centrifuged 
as in method 1 at 26,000xg in a continuous centrifuge 
and the supernatant is collected and passed through the 

30 continuous centrifuge a second time at 26,0GCxg. The 
supernate resulting from this second centrifugation 
contains <5% of the total SlpIII, but what is there can 
be recovered with MH^SG^ as described below. The 
pellets resulting from the ;5t and 2nd 26,OGOxg centri- 

35 fugations are combined and extracted with Li3r as 
described below. 



■ PCT/X-S87/02S:2 

Me "hoc 3 - rcr this mecricc, a ^zvzLr. c: 
coll is usee tha- contains a secoac piasr:ii vhich 
eacGCes ohe T7 ahage ly so 2: vTie, TInis plas-ii is ::c:n- 
catible with the plasrnii erLCCding -he SlpII" gene anh 
5 the dr'>-:g resls-ance ie-ernir.ant . The strain is grown 
in the sa:ne naehium ana under the sane ccncitions as in 
the first two :nethccs. However, due to the prccucticn 
of the T7 i/sorrynie insihe the Gslis, their cell wall is 
weakened and they can be easily lysed at the ccrtpleticn 

lO of the r er:nentaticn oy the adcition of iJTA tc >iCO 
and :;?U0 to a concentration of frcci 0.5 to :.Z'% v/v . 
Lysis can also te achieved oy the addition ci chloro- 
forni (20 nl per liter) of f ernientat ion broth instead of 
N'?^0. Alternatively, cells :T.ay be collected by centri- 

i5 fugation prior to lysis, resuspended to 1 kg wet 

weight/5 1 in Tris-ZDTA as described in the first two 
Lsethods and then lysed by the addition of M?-G or 
chloroforui. Following cell lysis by either method the 
lysate is csntrifuged in a continuous rotor at 25,0CGxg 
20 as described in the first two methods. As with those 
methods, Li3r extraction of the pellet and NHi,3Gi| 
precipitation of the supernate are used to recover the 
product . 

25 Purification of SlpIII 

The pellet obtained by cent ri fugation of the 
cell- lysate at 26,000xg as described above is extracted 
with an equal volume of 9M LiSr. The salt solution is 
added and the pellet is evenly suspended by stirring at 

50 room temperature (RT). The mixture is stirred for ^ 
hour at RT after an even suspension is obtained. The 
mixture is then centrifuged at 25,GG0xg in a continuous 
rotor at 4^G or at HT to generate a pellet and a super- 
natant fraction. The supernate is saved and the pellet 

35 is re-extracted with another equal volume of 9M liHr as 
above. After mixing for 1 hour the mixture is centri- 
fuged at 25,GG0xg and the supernate from this centrifu- 
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gation 15 coaitlned vltn tne supernate frcm zr,e Tirsz 
Li3r exoraction and allcwec stani at 4^C over-ighc. 
Approxioia- ely 9C« cf "ne SlpZZl ccn-ained In tne cell 
lysa-e 25,OCCxg pelle" 13 extraccec oy Li3r u^ing "nis 
5 procedure. 

After tne L13r extract stands overnignt at 
a precipitate forr.s, is ranrcved by cent r if ugat ion at 
25,CCGxg and li discarded. The supernate is then 
placed in dialysis bags and dialy-ed against several 

lO changes of dH20 for 2 days. As the Li3r is recncved by 
dialysis the SlplII product precipitates in the dialy- 
sis bags. The precipitate is ccllecoed by centrifu- 
gation and washed 2-3 times with dH^O. The final 
washed product is centrifuged and dried by 

^ 5 lyophilizat ion . 

"or the recovery of SlpIIl frcm the 25, COG g 
supernatant fractions, MH43O4 precipitation is used. 
Solid MH^SOh is slowly added to the sample which is 
maintained at 4^C, until 38% saturation is achieved 

20 (231 g/1). The mixture is then stirred at 4=»C for 2-3 
hours. The precipitate is recovered by cent r if ugation 
in a continuous flow centrifuge and washed 4-5 times 
with an equal volume of distilled H-O or with 0.5? SOS 
in H-0. After each wash the precipitate is recovered 

25 by continuous cent r if ugat ion . The pellet becomes 

increasingly white with successive washes as contami- 
nating protein is removed. SlpIII Is recovered as a 
washed pellet and can be dried by lyophil ization . 

30 Trypsin Treatment Step of SlpIII 

SlpIII was suspended in 50 mM Tris HCl , pH 
3.0, 0.1 M MaCl buffer, and was placed in a 37='G water 
bath, and T?C:< treated trypsin solution was mixed into 
the suspension. The final trypsin concentration was 

35 0.1^. After 3 hours, the solution was centrifuged at 

;5,C00xg for 15 min., the pellet was washed wioh a half 
equal volume of 0.5^ SOS in H^O first, then with 
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cl5cll-3c >ra-£r. Af^er eacr: ^ash cella" 
reccversc 'oy cent rifugat ion . The final prccuc3 vas 
resuspenciac in va-er ani kezz a- ^^C fcr f.:r-ner 
analysis. 

5 'v"i"n -ne trypsin treatnent, SlpIII vas 

purified to 99.-. purity. 

Physical Measurements of 31jII" 

Physical rneasuremenos of the purified silk- 

]0 lV<e proteins have been compared vith chose of Bc^rloyx 
:no r i silk in order to establish thac che repeciciye 
a:ninc acid pclyners produced oiicrodiologi cally 
accurately mimic the properties of naturally occurring 
polymers. Physical measurements were performed oo 

15 confirm the .model of anti-parallel chain pleaced sheet 
conformation for the crystalline regions of Bcmbyx mori 
silk fibroin (Marsh, Corey and Pauling, 5 ioohem . 
Biophys. Acta (1955) J^; Pauling and Corey, Prcc. N'aol. 
Acad. Sci. USA ( 1 953 ) i9_:247. Preliminary analysis of 

20 x-ray difraction patterns obtained frca Sip films are 
consistent with these described by Jraser, MacHai , and 
Steward (1956) (Table 4). Circular Dichroic (CD) and 
Fourier transform infrared (FTIR) spect rosccpi c 
analysis of SlpIII are consistent with a high degree of 

25 extended 3 and 3-turn conformations. Comparisons of 

the spectra obtained from SlpIII with thao of naturally 
occurring silk fibroin in various solvents (Isuka and 
Young, ProG. Matl. Acad. 3ci, aSA ( 1 965 ) 55 :1 1 75) 
indicate that SlpIII in solution consists of a mixture 

30 of the random and highly ordered structures seen in 
silk fibroins. 
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. 3 Example ^ 

S3SI Gene Cg r.s t ruG - 1 o r. : 

Six olisonuclec- ide strands were s ynt i-ea i zed 
and purified as deaaribed previously . 

20 

CE3.1) Seal 3anil 



25 



30 



ui. 



IV. 




fTGTAGGCGTTCCGCCAG l iGG 



5-SHSlCACAC-rACaCCTCGAACACCCACrCCAGGTACACCAACTCCCaGA 
ACCCCTACACCrCGAACTCCrACACCrOiiiuA^Aj 



V, 



. A r-,- A r -AGCCIGTAG^SlCCrTCGAGGGGGTGGrGGTAG 



j-CCeLGGGTAGGAGTACCAGGGGTA GvCG '^- -^ —J-^^t 
CGGCGCAGGCGC--vgC7CGGGAGTAGGCa±i^^^ 



35 



PCr/U58-/02822 

53 

OlL^cn-jiQleozlze strands (iii), (Iv), (v) and 
(vl) vere annealed and llga-ed witn tne DMA cf pla^mid 
p3Suil3(-^) (Stratagene) vhicn had been digested vitn 
Hind XII and Icc RI. The products of this ligation 
5 reaction were transferred inoo ^. coli strain JM'C9. 
Transf orr^ant colonies were selected for resistance to 
a.noioillia. Colonies vere screened for their hytridi- 
zatlon vith -^?-labelled oligonucleotides (iii), Cv). 
Plasmid D^[A from several pcsitiYel/ hybridizing clones 
lO '-^'as purified and sequenced. Tvo of the plas^nids, 

pSY:292 and p3Y1 293, contained the sequence shov^n for 
oligonucleotides (iii), (y) and (ir), Cvi). These 
sequences contained all of the nucleotides present in 
this synthetic oligonucleotides except one. A GrC 
15 basepair ;.ras missing at position 7 (iii). The lack of 
ths basepair obstructed one of the 5anl sites. In 
order to introduce a second 3 an II site at the 5' end of 
the gene fragT^ent, oligonucleotides (i) and (ii) were 
annealed and ligated with plasaid p3Sai"I3(-) which had 
20 been digested with Hind lll and ScoR I. Plasmid DNX from 
the tranaformant colonies resistant to ampicillin was 
purified. Two plasmids, pSn 295 and pSY1296, which 
were digestible with Stu I, a unique site contained in 
the oligonucleotide sequence, were sequenced. They 
25 were both shown to contain the sequence shown for 
oligonucleotides (i) and (ii). Plasmid DNA from 
P3Y1292 was digested sequentially with Hin dlll, SI 
nuclease, and EooR l . The digestion products were 
separated by electrophoresis in an agarose gel and the 
30 DN^A fragment of approximately 1 50 basepairs was excised 
from the gel. This fragment was digested with 

plasmid D^^A pSY1 296 which had been digested with 3tul 
and Sco RI. The products of this ligation reaction were 
transformed into coli strain JM1G9 and were selected 
35 for resistance to ampicillin. Colonies were screened 
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55 



far hvbrilizaticr: to -'^ ? - lab elle d o 1 1 gonuc 1 ec " i ce (v; 



In- plasoiid DNA from two posi-irel/ hybridizing clones 
was c^aririeci anc secuenced. T'nese plasuilas were na.T.el 
d3Yi257 an:l pSY' 253 . They con-ameci the f ollowing 
sequence : 



(Hindizr) Banir 

i^C:::2a^CTCTGaAGTAGCTGrCCCAGGTGTAGCAGTTCC::c:CTGTAGGCG':'rCC0GGAG £ : 
TCGAGCGGAGAGGTCATCCACACGGTCCACATCCrCAAGGGCCACAr; 



TTGGTGTACGTGGAGTGGGTGTTCCAGGCGTAGGTGTGCGGGGGG'rAGG AGTACCAGGGG 2.2 0 
A.^GGACArGGAGGTCACOGACAAGGTCCGCATCCACACGGGGCCCA'rCC'rGAOGGTCCGG 



1 5 




20 



ZcoRX 
GCTTAAG 



H:3SI Multlnier Gene Assembly: 

The 3an l. acceptor plasmid pSY93T was aiodified 
25 in order to accept Ban ll terminal cohesive DMA frag- 
nents. Two oligorLLiGleotides were synthesized for this 
purpose , 



30 



viL ^ T7A~CrT ATC TTTAAATATT C TCGCGAA CGTrTTrGTA7 COCCTC GATGTG l 



viil 



2Ci.CGC-TAACACATC^A.i^I^ATACAAAAA 



CG — rCCG A G A ATAT TT AAA CATAG 3' 
.Vni Sspl Orzi 
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Gllgon:^cleo3ice3 (vii; and (vlii) vers 
annealed and ligarad wizh placrnid Dl^X oSZ937 wi^icn wa^ 
digesTied wtzh 5a- HI. Tne products cf cni5 liga-ion 
5 w-ere transfcrr.ed in-o ccl i strain JM-09 and cclcnles 
were aelected for resistance to chlo rampneni oo 1 . 
Tran3f ormant colonies were screened 07 'nybri diza" ion 
to -^?-laaelled oli gonucleo-ide (rii), Plas-iid 
from two pcsiciTalv nybridioing clones, pSr:299 and 
lO p3Yi300, contained tde sequence shown for oligo- 
nucleotides (Yii) and (viii), as determined d/ IMA 
sequencing, 

Plasmid pSr'293 was digested wicn 5an II 

and the digestion drag.-nents separated 07 agarose gel 
15 electrophoresis. The E3SZ gene fragment, ap p r oxi.-nao ely 
150 base pairs, was excised and purified by electro- 
elution and ethancl precipi cat i on . Approxlr.at ely 1 ug 
of purified fragmeno was self-ligatec in order to pro- 
duce multiiners ranging in size from 450 bp to 6,CC0 bp. 
20 The products of the self -ligation were then llgated 
with plasdid DMA pSY1299 which had been digested with 
3an XI . The products of this ligation reaction were 
transformed into^. ecli strain HBIOI. Transf ormants 
were selected for resistance to chloramphenicol. 
25 Plasmid DUX from individual transf ormant s was purified 
and analyzed for increased size due to S3SI nultimer 
DNA insertions. Ten clones (p3Y1 240-1 2U9 ) with inserts 
ranging in size from 1.5 :<bp to 4.4 Kbp were obtained. 

30 Hlxpression of E5SI Multinier Gene: 

One of these clones, p3Zi243, which contained 
a 4 Kb iBSI multi:iier gene was recloned in the 
expression vector, p5Y75i. Plasmid DMA from pSZ;2^3 
was digested with ^Tr^I and PvuI I , separated by agarose 

35 gel electrophoresis, and the DMA band corresponding to 
the E5SI multimer gene was excised and purified by MACS 
purification. DMA from plasmid p3Y751 was digested 
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with Pvui: anci ligated wi-ri the Mru I- ?vu II frasnent 
from pSY12U3. The procuGts of this ligation were 
transfoTTied into^. ccli H51 01 , and the t ransf o r^ian 1 3 
selected for resistance to a:n?icillin. Two clones were 
5 isolatec containing the new olasmid •DS'r.loO, Z_. coli 
cells containing pSYl230 were grown at 3C^C to an GC5QQ 
of 0.7 and then shifted to ^2^C for 1.5 hours. The 
proteins produced by these cells was analyzed by SDS- 
?AGE, The separated proteins were transferred to 
10 nitrocellulose paper and detected by i xmuno r eac t i '.^ i t y 
with anti-ZL? rabbit serum. A strongly reactive 
protein band was observed with an apparent rnolecular 

weight of !2C k2 , 

The Auipicillin drug resistance gene of pSYi2S0 
15 was substituted with the Kanaoiycin -.arker and the 

subsequent plasmid was called pSri 332. This plasrnid 
was used in ferment aticn for the purification of ESSZ, 
(See Methods) 



20 



25 



pSYI 332/pSY1230 H3SI Protein IU13 AA NIW 113,159 

MDP V VLQRRD WEN? GVTQL MHL A-^rlP PFASZ3S CMGS 
C(GVGVP)3 (GAGAGSGAGAG3) ^ 
MCY?-\HGYQLSAGHrHYQLVWCQK 

Purification of H:3SI Protein: 

E. coli strain H310i containing plasmid 



PSY1230 was fermented in 10L volume. The cells were 
concentrated by filtration and further harvested by 

30 centrif ugation. Pelleted cells were stored frozen at 
-70='C until processed. Frozen cells were thawed on ice 
and suspended in U ml of 50 mM Tris-HCl pH 7.0, 10 mM 
EOTA, 5 mM PMS? per gram wet weight of cells. The 
cells were broken by French pressing twice at 15,000 

35 psi and then cooled to O^G. The crude lysate was 

cleared by cent r if ugat ion at 26Kxg for 20 minutes. The 
supernatant proteins were precipitated by addition of 
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50111 a:nnicni-;:n sulfate to 2Co of sazura'icn '1* - g/l' • 
The precipitata was collected centri f ugat ion at 
lOKxg for 10 ::iinute3. The pellet was resuspencel in 10 
sil of and cial/zec against 10 ::iM Tris pH 3.0, C,15M 

5 ^^aCl a- 4^C. The dial/cad solution was digesteo wi:n 
0.1? Trypsin (SigT.a) for 1,5 ncurs at roorn ter.per at ure , 
and repreo i pi tated witn Z0% aamcniua: sulfaoe. The pre- 
cipitated protein was resaspended in and dial/zed 
against 10 JiH Tris pH 7.0, 1 3iM SDTA ao 4-C. The pro- 
10 tain purit/ of this sample was analyzed a/ ajiino acid 
cocnpos-ion and deterinined to be 83*. 



zlas-ic ?roper-ies of ^331 Protein: 

The soluble preparation of se:ni -pur id ied I3S1 

15 protein described abcve was incubated at 37'C fcr 30 

jiinutes and centrifuged at lOKxg for 10 minuoes ao rocG- 
temperature. This treatoient caused the 3:330 protein 00 
aggregate, becooie insoluble, and pellet inco a trans- 
lucent solid. The solid was resistant to mechanical 

20 disruption either by Tortexing or by maceration using a 
glass red. The solid could be cut with a razor blade 
into strips which exhibited a high degree of 
elasticity. They fully retained their shape after 
repeated extensions and relaxations, Thej resisted 

25 compression with no apparent irreversible deformation 
of structure. 

E35I Purification 

Z3SI sample i~70% pure) was dlalyzed in 50 mM 

30 Tris HCi, 50 mM ^laCl , pH 3.0 at 4°C overnight with one 
change of buffer. If precipitation was observed, the 
sample was centrifuged at 2T,000xg for 15 min at 4^C. 
All remaining steps were performed at 4^C, The super- 
nataao was applied 00 a DE AZ-3ephacel column which had 

35 been ecailibratad wioh 50 mM Tris HCI, 50 mM ^;aCl, pH 
3.0. The flow through fractions which contained E3SI 
were collected and pooled. NaCl was added to the 
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ocGl5C rraccions from AS -Sep hac el cclunin "o -axe a 
final Gcp.cent raticn of 2 M MaCl in the sample. 
InsGl'Jble .:nacerial was remcveci 07 cen t r i f ugacl on at 
2'^,G0Cxg for 20 T.in. Tne supernatant vas then loacied 
5 onoo Phenyl-Sepharose colu-n which was equilibra-eo 
wi-h 50 jiM sodium phosphate buffer, pH 7.C, with 2 M 
MaCl. The column was washed extensively ''^'ith buffer 
until ro elucing protein was detected by A230 • "^-^ 
column was then elu-ed stepwise with 50 mM scdium 
10 phosphate o^Cfev. pH 7.C and finally with water. The 
E:33I aooi'/e fractions were pooled and stored at for 
further analysis . 

With the addition of these steps to the 
previous procedures, ^0Z% pure Z3SZ was obtained. 

15 

Example 5 

ELPI Construction and Sxoression 

Two oligonucleotide scrands were synchesized 
20 and ourified as described in the Methods section. 



25 



GCGi i. CCCCCaGTaG<j GCTGCCA .> 3 
il) • j'-C^-LALLrx^GCCACATCCTCAAGCCCCACArc 

AGC<ccrcArcc ccACC<rrTcgA -5' 



Saai 

■TAGC-CGTT CCC"-<:-G TA G 



Seal 



ICATCCGCA 



30 



35 
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The -wo cli gcrLUGlao tide 5-rancs were annealsl 
and liga-ed wizr, z'r.e D^^A cf plasoiid p33 x13;^) 
(Stra-agene: which had oeen diges-ed vi-h jlindri: 
and Ecc ?.! . 

5 The prcduc~3 of this ligation reacoicn were 

transfornied into coli otrain JMi09. Tr ansf o rT.ano 
colonies were screened for their hybr i d i zat 1 cn 
with -^?-ladeled oligonucleotide (i). Plasmid DMA from 
pcsitiYel/ hybridizing clones was purified and 

10 aecuenced. One plasmid, pSY123T, contained the 
sequence shewn for oligonucleotides CD anc Cii). 

Plasmid DMA ifroai p3Y1 2S7 was digested wloh 
5 an I REM and the digestion frag:nent3 were separatee by 
agarose gel electrophoresis. The EL?I gene fragnieno, 

15 approxi.nat ely 50 bp, was excised and purified by MACS 
colutin. Approxiuiately ; of purified fragnient was 
self-ligated in order to produce aultiiiers ranging in 
size fro3 3GQ bp to 5000 bp. 

The products of the self-ligation were then 

20 ligated with plasmid DMA pSY937 which had been digested 
with REM 3 an I. The product of this ligation reaction 
was transformed into coli strain HBIOI. Transfor- 
niants were selected for resistance to chloramphenicol. 
Plasmid DMA from individual transf ormants was purified 

25 and analyzed for increased size due to EL?I multiple 
DMA insertions. Four clones (pSY; 338-i 39 1") with 
inserts ranging in size from 1.0 kbp to 2,5 kbp were 
obtained. These clones were recloned in the XPp^ 
expression rector pSYTol. The clones obtained 

30 (?SY1 392-1 395) were used for expression of EL?I. 

The EL? I protein had the following amino acid 
com position: 

pS'ri 395 EL?I Protein 859 AA i^W 72,555 

35 

MD?VVLQERD;^EM?GVTQLMRLAAH???ARMXLAlRrt 
[ ( V? GYG) u ] 40 VP^rtTRVDLSAGRYHYQLVWCQ-C 
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s;l?^ Gene Co " rue " 1 c ana Sxpresalcn 

Twc oligcnucleooicie s-rends were syncnesizec 
anc purified as described in zhe Me-nods 3ec"ion. 

F5c: ?vu:: 3na3I (=3t:) 
(1 ) 5 ' " GraCGCA GCTCg TACGTA G CTGCA -B ' 

(?stl) FvuII 
( ^ ^' 1 3 ' -AGGTG i A C G C G 7 G G A 0 G A T G C A T C G - ' - 
r3?: SnaSI 

These o 1 i gonuc 1 ec t i de strands were annealed 
and liga'ied witri plasmid pSYi304 which had deen 
digested wi-h Psz Z HIM (pSY130^ cirfers frcm p5Z357 cy 
having a .t^o no ra 8 r i c unit in place of the t r i e r i c u n i - 
of p3:f357)., Plasmid DMA from t r ans f o rT.an t colonies 
resistant to chio rampheni co 1 was purified. Gne 
plasoiid, PSY1 365, which was digestidie with R£M Sna3 -, 
was s e G u e n G e d and proven to be correct. 

EL?I gene fragment purified as descrided (£L?Z 
construction and expression) was treated with Mung 3ean 
Muclease as descritied by supplier (Stratagene) . The 
DNA fragments mixture was then ligated with plasmid DMA 
pSYl365 which had been digested sequentially with RENs 
Fsp l , 3na 3i and calf intestinal phosphatase. The pro- 
2=; ducts of this ligation reaction were transformed into 
A* coli strain K3101 and were selected for resistance 
to chloramphenicol. Plasmid DMA from individual trans- 
formants was purified and analyzed for the IlPI mcncmer 
DMA insertion. Two plasmids, pS'£^3o6 A and 3, were 
^0 sequenced. They were both shown to contain the EL? I 
DMA sequence in the correct orientation. 

Plasmid DMA pS^i 1 365 was digested with HEN _3an_I 
and the DMA fragment containing the 3E1P1 monomer was 
gel purified. To create mul timers, l ug of the SELPl 
35 DMA fragment was s e If - 1 i ga t ed . Muloimers were obtained 
ranging in size from 5G0 bp to 10 kbp . The SEl?: 
multimers were cloned into the Ban I site of pS'f '152. 



20 
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? osiz Lve clonic were charac i e ri -ec 'zy gel elec:rc- 
paoresis -cr trie aire of -he Inserzed rriultlzer and u33c: 
for express icn. and protein anal/sis . 



pSYi39c SiLri Pro-ein 2025 AA * ^^8 , 2 i 2 

MIjPvYL;;RRD;sZN?GVrQLMHLAAH???ASDPMGAGS (GAGAGS) - 
[GAA(7?GVG)4 VAAGY (GAGAGS ) g]24 
10 GAACVPGVG)^ VAAGY (GAGAGS) 3 GAGAMCPGHYQLSAGRYHYCLlVWC^K 

SILP2 " Mono:ner Goris true- i on 

Plasuiid D^iA PSYI295 was digested wi3h Ban GI 
HEN and the E5SI gene fragment was purified as 
I 5 descrioed previouslv. The £BSZ [ncncmer f rag-en' was 
liga-ed into pSZ1 504 (pSY957 containing a nicnocier of 
SlpIII, consoructed as pSYSoT) which had been digested 
with 3 ani l HEN and treated with calf intestinal 
phos phat ase ) . 

20 The products of the ligation fixture were 

transf orined in coli- strain K3101. Transforniants - 
were selected for resistance to chloramphenicol. After 
restriction anal/sis of several isolates, one plasnid 
was chosen, pSYlSOt, containing a DNA fragnient 

25 corresponding to the Z3SZ iicnomer gene. 

SELP2 - Multiple Gene Assembly and Expression 

Plasmid DNA pSYl 501 was digested with HEN Ban ! 
and the DNA fragment containing the 3ELP2 '^monocer'^ was 
50 gel purified. To create muloimers, ; ug cf the SELP2 
DNA fragment was self -ligated . Multimers were obtained 
greater than 12 kb in size. 

The SELP2 multimers were cloned into the 3 an I 
site of p3Y1262, Positire clones were characterized by 
35 gel electrophoresis for the size of the inserted 
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mul " imer . The clones wL zr, inserts ranging in size frca 
1.5 to 11 ko were selected. Plasniid DNA pSY:372 
containing an insert of 5 kb (i3 repeats) vas used for 
further analysis and protein purification. 

SE:1?2 - Protein Purification 

E. coli strain H310i containing plasoiid 
pSYi372 was fermented according to the procedure 
described in Methods for fermentation. The cells vere 
;0 harvested by centrifugation. Pelleted cells were 

stored frozen at -JCC until processed. Frozen cells 
were thawed on ice and suspended in 4 ml of 30 mM Tris- 
KCl, pH 7.0, 1 0 mM SDTA, 5 mM PMSF per gram wet weight 
of cells. The cells were orokea by passing through a 
15 Gaulin cell "di s rupt er at 3,000 psi. The crude lysate 
was cleared by centrifugation at 26,OOOxg for 20 min. 
The supernatant, which contained >75% of the SELP2 
protein, was precipitated by addition of 20? ammonium 
sulfate (114 g/L) . The precipitate was collected by 

20 centrifugation at 10,000xg for 10 min. The pellet was 
resuspended in 10 ml of H^jO and dialyzed against 1 0 mM 
Tris pH 3.0, 0.15 M MaCl at U^C. The dialyzed material 
was centrifuge d at"26,000x for 15 min in order to 
collect the insoluble fraction of protein which 

25 contained approximately ^0% of the SELP2 protein. This 
insoluble protein pellet was washed twice in 0.2$ SDS 
at 50^0 for 30 min with occasional shaking. The 
insoluble protein was collected each time by centri- 
fugation at 25,000xg for 15 min. followed by a wash of 

30 50 J ethanol. The final protein pellet was resuspended 
in water and analyzed by Western blot analysis and 
amino acid composition. 3y Western blot the SE1.P2 
protein appears to be homogeneous in size consistent 
with its large molecular weight (>150 kd). 3y amino 

35 acid composition the 3£L?2 preparation is approximately 
30% pure and the observed molar ratio of amino acids 



PCT/USST/0:8:2 

5 3 

( 3 er : C-ly : Ala : ? re : Val : Tyr ) agrees very closely wl-h -r.e 
expected ccsiccsitLca as predictec frcn: tde SZ'^?2 
seque::ce presen" in p31fl3T2. 

5 cSY13"2 S£l?2 ?rc:5in 2055 AA >iW^ 52,55- 

MD?VVLQaHDWEN?GVTQLNRLAAH??rASD?MGAGS (GAGAGSJ^ (GVGV?;3 
:(GAGAGS)5 GAAGY (GAGAGS)^ iGVQV?)f^,r 
(GAGAGS)^ GAAGY (GAGAGS)2 GAGAMDPGRYQL3AGH:fHY';i7WGQK 

1 0 

SS1F3 - Construction and Expressioa 

Plasmid ullX p3i:i3Cl was partially digestac 
witn RS^J' Hae ll ano tde digestion fragmenos separated ty 
agarose gel electrophoresis. The larger DNA frag^ients 

^5 were excised and purified by MACS Goluinn.. The purified 
fragments- vers self -ligated , the ligation reaction was 
heated at TO^C for 15' to inactivaoe the T4 DMA ligase 
and eventually digested with REN Pst l. The digestion 
mixture was then transf orTned into coli strain JMi09. 

2.0 Transf oraanta were selected for resistance to Ghlorain- 
phenicol. rlasnaid DMA from individual t ransf ormants 
was purified and analyzed for: (1) resistance to REM 
?3 t l ; and (2) deletion of 60 bp Hae ll fragment: con- 
tained within the SELP2 gene fragment. One clone 

25 (p3Z137T) satisfied both requirements. Rlaamid DMA 

PSY1 377 was digested with REM 3anl and the DMA fragnient 
containing the SEL?3 rnonomer was gel purified. To 
create tnult ioiers , 1 ug of the SE1,?3 DMA fragnient was 
self -ligated . Multimers were obtained ranging in size 

30 from 500 bp to 10 kdd . The 3ELP3 siuiti^iers were cloned 
into the Han i site of p3Y1262. Positive clones ware 
characterized by gel electrophoresis for the size of 
the inserted aiulti^er and used for expression and 
protein analysis. 

35 
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p3:ri 397 S£L?3 Protein 2257 AA MW ^ 6 3,535 

MD?V7LCRR2WIN?GVTQLNH:.AAH??rA3D?MGA3S ( GAGAGS : ^ 
[(GVGV?) 3 (GAGAGS)3]2a 

5 ^G7Gv?):i ;gagags}- ga gamd ? gh ^qls A gr :rH vwg 

3L?^ - Construction and Expression 

Plasaid GMA pSYl304 (p3Y357 wltn a single 
moncnaerio unit as distinct fro.:n the trl.'nerio unit, of 

10 p3Y857) was partially digested with RZN Hae ll and tne 
digestion fragments separated oy agarose gel 
electrophoresis. The larger D.^iA frag.r.ents were excised 
and purified dy MACS column. The purified fragt:ents 
were se 1 f -1 i ga t ed , the ligation reaction was heated at 

15 70^0 for 15'- to inactivate the T4 DMA ligase and 
eventually digested with REM ?st l. The digestion 
sixoure was then transformed into Z_, coli strain 
o'M109. Transf ormants were selected for resistance to 
chloramphenicol. Plasmid DMA from individual 

20 t ransf ormanta was purified and analyzed for: (1) 

resistance to REN Pst I ; and (2) deletion of 60 bp Hae ll 
fragment contained within the SSLP2 gene fragment. One 
clone (p3Y1373) satisfied both requirements. Plasmid 
DMA pSYi373 was digested with HEN 3an l and the DMA 

25 fragment containing the SLPU monomer was gel purified. 
To create multimers, i ug of 3LP4 DMA was s el f -1 i gat ed . 
Multimers were obtained ranging in size from 300 bp to 
5 kbp. The S1P4 multimers were cloned into the 3 an I 
site of p3Y1252. Positive clones were characterized by 

3G gel electrophoresis for the size of the inserted 

multimer and used for expression and protein analysis. 

pS'ri393 SL?^ Protein nOl AA >r,y 76,231 

3 5 MD p 7VLQRRD W-EM? GVTQLMRLAAH? P EAS D? MGA G3 [ ; GA'GA G3 ) 5 ] 

(GAGAGS; GAGAMDPGHYQLSAGRYHYQLVWC:;?: 
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A5 is evii-nt frcci the aicye rs5ul-s, highly 
repetitive sequences can be creparac, clonec, ar.c usee 
for expression to prccuce a wide rariezy of prccuccs 
which zav -iin:ic natural products, such as silk and 
5 other proteins and antigens. In addition, novel 

systems are provioed for controlling the expression of 
the peptide under inducidle conditions in a variety of 
hosts. In this manner, new prode inaceous products can 
be provided which provide for new properties or :iiay 
lO closely mimic the properties of naturally occuring 
products . 
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i5 All putlications and patent applications 

nientioned in this specification are indicative of the 
level of skill cf those skilled in the art to which 
this in^vention pertains. All publications and patent 
applications are herein incorporated by reference to 

20 the same extent as if each individual publication or 
patent application ^^as" specifically and individually 
indicated to be incorporated by reference. 

The invention now being f ully _ des cri bed , it 
will be apparent to one of ordinary skill in the art 

25 that many changes and modifications can be oiade thereto 
without departing from the spirit or scope of the 
appended clai:ns . 
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>jHAT is CLMMKD 13 : 

I . A DMA sequence encoding a peptide 
containing an oligopeptide repeating unit, wnich 

5 repeating unit is charaa t: er i zed by containing at least 
tnree different a^-nino acids and a total of from ^ to 30 
amino acids, there being at lease two repeating units 
in said peptide and at least tvvo identical amino acids 
in each repeating unit and '^-heceir. said units are 

0 ootionallv joined dy an amino acid bridge of from about 
1 to :5 amino acids, said DMA sequence having the 
following formula: 



15 K^(WMX^MY^) ^1;^ 

where i n : 

K and L are each DNA sequences encoding an 
amino acid sequence of from about 1 to iGO amino acids, 
20 K and L being fewer than about 20 number % of the total 
ami no acids; 

k and 1 are 0 or 1 ; 

W is of the formula: 

25 C(A)p^ (3)^:^ 

A is a D^^A sequence coding for said 
oligopeptide repeating unit where A will contain from 
about 12 to 9Gnt, at least two codons coding for said 
50 identical amino acid in said repeating units being 

different, where there will be at least two different 
A' 3 differing by at least one nucleotide; 

3 is a D^fA sequence different from A coding 
for other than the oligopeptide unit coded by the A 
35 unit and having from about 3 bo 45nt, where the 3 units 
may be the same or different; 

n is an integer in the range of 1 to 100; 
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eacri p is incecencently 0 or 1 ; ar.c 

q is at least i and Is selectee so as to 
prey ids a d'AX sequence o!" at least rOnt; 

M and N are tne same cr dif.erent and are a 
DMA sequence of ccdons and are of C to ^dno in reading 
fra:ne >;ith v/ and X respect iyel/; 

X is the sase as or different from V and is of 
tine forTTiiila: 



10 [(A' ) ,C3^} , 

u ' ? ■ q ' 

X is the saoie as cr different from W and is of 
the forniula: 

n^ ?- q- 

vherein: 

all of the svniDols come within the definitions 
of their letter Gouncerpart s ; 
20 X and y are 0 or 1 ; 

■ i is 1 to 1 00"; and 
the total of q, q^ , and q^ is at least 1 and 
not greater than about 50. 



25 2. A Di^A sequence according to Claini 1 , 

wherein said OfiA sequence is not greater than aoout 
1Qknf. 

3. A DMA sequence according to Clain 2, 
30 wherein H and }l are Ont and az least one of x and y is 
0. 



4. A DNA sequence encoding a peptide 
containing an oligopeptide repeating unit, which 
35 repeating unit is characterized by containing at least 
three different a.nino acids and a total of frcm 4 to 50 
anino acids, there being at least two repeating unios 
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in saia pepticie and ac least -wc iie-^ical arninc acias 
in eacn repeating unit and vvnerein aaic units ara 
cptionallY Joined ty an a.Tiino acid triage cf frca accu: 
1 to to a:nino acids, said DMA sequence naming tne 
5 following formula: 

wherein: 

13 :< and L are each DMA sequences er.cocLr.^ an 

amino acid sequence of from about 1 to 100 amino acids, 
K and L being fewer t:nan about 20 number % of tne total 
am i no acids; 

;< and 1 are 0 or i ; 
15 A 13 a DMA sequence coding for said 

oligopeptide repeating unit where A will contain from 
about 12 to 9Gnt, at least two codons coding for said 
identical amino acid in said repeating units being 
different, where there will be at least two different 
20 A'3 differing by at least one nucleot i de ; 

3 is a DMA sequence different from A coding 
for other than the oligopeptide unit coded by the A 
unit and having from about 3 to U5nt, where the B units 
may be the same or different; 
2 5 n is an integer in the range of 1 to 100; 

each p is independently 0 or i; and 
q is at least 1 and is selected so as to 
provide a DMA sequence of at least 90nt. 

30 5. A DMA sequence according to Claim 4, 

wherein said A codes for the same amino acid a" least 
three times with at least two different codcns . 



35 



5. A DMA sequence according to Claim 4, 
wherein said same amino acid is glycine. 



wo S8/03533 



PCT/L'S8T/02822 



7. A d:IX secuence sncccing a peptide 
containing an oligcpeptiae repeating uni", which 
repeating uni" is charac-sr ized a/ ccmprising the a-inc 
5 acid sequence GAGAGS cr GVGV? , sail 'Zllk sequence 
cc:nprising the relieving formula: 

[CA}^(3)3j. 

10 wherein: 

A is a DUX sequence coding for said 
oligopeptide repeating unit, wherein the codcns for at 
least two G's in the saine or different A's are 
different, where there will be at least two different 
^5 A's differing bv at least one nucleotide; 

3 is a DMA sequence different frooi A coding 
for other than the oligopeptide unit coded by the A 
unit and having from about 3 to 45nt, where the 3 units 
aiay be the sase or different; 
20 n 13 an integer in the range of 5 to 25; 

each p is independently 0 or 1 ; and 
q is ab least 1 and is selected so as to 
provide a DMA sequence of at .least 90nt. 

25 3 . A DNA sequence according to Claisi 7, 

wherein said oligopeptide Is GAGAGS and 3 comprises a 
codon for tyrosine. 

9. A DMA seQuenQs according to Claim 7, 
30 wherein an A is of the formula: 

GGTGCGGGCGCAGGAAGT , 



35 
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:0. A DMA secuenae acccrding to Claim 3, 
vr.erein an A comprises the sequence: 

GGT GCC GGC AGC GGT GCA GGA GGC GGT TC7 GGA GCT GGC GGG 
5 G G C T C T G G C G C G G G G G G A G G A . 

11. A DN'A sequence acccrding to Claim 7, 
vherein said c 1 igo peptide is GVGV? . 

12, A DMA sequence according to Claim I1, 
wherein one A is of the formula: 



GGTGTAGGC GTTCCG. 



15 I 3 .' A DNA sequence according to Claim 1, 

wherein VTMX comprises the sequence: 



GGA GTA GGT GT G CCA GGT GTA 

CCG GGA GTT GGT GTA CCT GGA 

20 GTG CCC GGG GTA GGA GTA CCA 

GGT GCT GGT AGC GGC GCA GGC 



GGA GTT GGG GGT GTA GGC GTT 

GTG GGT GTT CCA GGC GTA GGT 

GGG GTA GGC GTC CCT GGA GCG 

GGG GGC TCC. 



14. A polypeptide comprising the recombinant 
DNA expression product of a DMA sequence encoding a 

25 peptide having repeating units of an oligopeptide, 

which oligopeptide is characterized by having at least 
three different amino acids and a total of from U to 30 
amino acids, there being at least two repeating units 
in said peptide and at least twc identical amino acids 

30 in each repeating unit and wherein said units are 

optionally joined by an amino acid bridge of from about 
1 to 1 5 amino acids, said DMA sequence having the 
following formula: 

35 K X / . - 
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where in : 

:< anc L are each ZKX sequences encoding an 
anino acid ssqiiencs of frcr. abcu" ; tc '00 a:?.ino acids, 
K and L ceing fewer t:nan accuc 20 nuzder ^ cf -de tccal 
5 aniino acids; 

k and i are 0 or i ; 

W is of the forniula: 



1 0 

A i3 a O.^IA sequence ceding far said 
oligopeptide repeaoing unit where A will contain fro::: 
about 12 tc 90nt, at least two codons coding for said 
identical ainino acids in said repeating unit being 
i5 different, where there will be ao least two differeno 
A'3 differing by at least one nucleotide; 

3 is a D^TA sequence different from A coding 
for other than the oligopeptide unit coded by the A 
unit and haying from about 3 to 45nt , where " the 5 units 
20 may be the same or different; 

n is an integer in the range of 1 to 100; 

each p is independently 0 or 1 ; and 

q is at least 1 and is selected so as to 
provide a DNA sequence of at least 9Gnt, 

25 

15. A polypeptide according to Claiai 14, 
wherein said polypeptide will include as a repeating 
unit at least one of GAGAGS and GVGVP. 



30 16. A peptide comprising the recombinant DNA 

expression product of a DNA sequence encoding a peptide 
having repeating units of an oligopeptide, which 
oligopeptide is characterized by having at least three 
different amino acids and a total of from 4 to 30 amino 

35 acids, there being at least two repeating units in said 
peptide and at least two identical amino acids in each 
repeating unit and wherein said units are ap::iQnally 
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Joined by an arninc acid bridge of from about 1 to 1 5 
amine acids, said DMA sequence having the following 
f o rmula : 

5 ^k^— 'n ^^^p-c^l 

wherein: 

K and L are each DMA sequences encoding an 
amino acid sequence of from about ■ to :00 amino acids, 
'0 K and L being fewer than about 20 number % of the total 
amino acids; 

k and 1 are 0 or 1 ; 

A is a D:IA sequence coding for saib 
cligopeptide repeating unit where A will contain from 
o about 12 to"90nt, at least two codons coding for said 
identical amino acids in said repeating unit being 
different, where there will be at least two different 
A'3 differing by at least one nucleotide; 

3 is a DMA sequence different from A coding 
20 for other than the oligopeptide unit coded by the A 

unit and having from about 3 to U5nt, where the 3 units 
may be the same or different; 

n is an integer in the range of 1 to 100; 
each p is independently 0 or 1 ; and 
25 q is at least 1 and is selected so as to 

provide a DMA sequence of at least 90nt. 

iT, A polypeptide according to Claim 16, 
wherein A codes for GAGAG3 or GVGV?. 

30 ■ 

13. A regulatory system for regulated 
transcription in a prokaryotic host comprising: 

(1). a DMA sequence which comprises a 
structural gene encoding an RMA polymerase exogenous to 
35 said host which is capable of transcribing DMA to 

messenger RMA and which is under the transcriptional 
control of an inducible promoter and 
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(2) a structural gsr.e er.cccing a 
pcl/peptlte of Interest under the t r a ns ar i p t i c nal 
ccn-rcl cf a prc:nc-:er vhich is no- func::lcnal vith the 
enhcge-cus H:rA pclyreraoe cf said hcsz, tu" is 
5 runc-ional vi3h said exogenous HNA pclyrr.er ase . 

19. A regulatory system according to Olain 
i5, vherein said inducible promoter is the ceta- 
galaotos idase prc:!io::er or the spcVG prorncter. 



10 



20 



50 



20. A prokaryooic hos" comprising a DMA 
sequence according to Clai" 1 . 

21 . A meohod for producing a peptide 
coniprising t'he r ecoriib i nan t DMA expression product of a 
DNA sequence encoding a peptide haying repeating units 
of an oligopeptide, which oligopeptide is char aco eri zed 
by having at least three different a-ino acids and a 
total of from 4 to 30 amino acids, there being at least 
two repeating units in said peptide and at least two 
identical amino acids in each repeating unit and 
wherein said units are optionally joined by an amino 
acid bridge of from about 1 to 1 5 amino acids, said DMA 
sequence having the following formula: 

wherein : 

K and L are each DMA sequences encoding an 
amino acid sequence of from about 1 to 1 00 amino acids, 
:< and L being fewer than about 20 number % of the tocal 
amino acids; 

k and 1 are 0 or 1 ; 

W is of the formula: 
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A is a DMA sequence codir.g for said 
cligcpep^:ida repealing uni" wr:ere A will contain frcci 
about: 12 oo 90n- , at least two codons coding for said 
identical amino acids in said repealing unit deing 
5 different, w'nere tnere will be at least two different 
A's differing by at least one nuc lee tide; 

3 is a DMA sequence different from A coding 
for c t n e r than the oligopeptide unit coded c / the A 
unit and haring from about 3 bo U5nt, where the 3, units 
lO may be the same or different; 

n is an integer in the range of i to IGC; 

each p is independently 0 or 1 ; and 

q is at least ' and is selected so as to 
provide a DMA sequence of at least 90nt; 
15 M and M are the same or different and are a 

DMA sequence of codons and are of 0 to l3nb in reading 
frame with and X respectively, encoding an amino acid 
sequence ; 

X is the same as or different from W and is of 
20 the formula: 

[(A') .(3^^ T 
n ' ? Q 

Y is the same as or different from W and is cf 

2 5 the formula: 

C(A2)n2(32) 2 
p- q^ 

wher ei n : 

30 all of the symbols come within the definitions 

of their letter counterparts; 

X and y are 0 or 1 ; 
i is i to 100; and 

the total of c, q\ and q^ is at least 1 and 

3 5 not greater than about 50 ; 

said method comprising: 



' PCT/US8-/02S22 

32 

growing a proKaryotic host aocorling zz 

ClaiT. 20 , 

vherain. saii :^-IA sequence is uncer the 
transcriptional and t r anslao icnal regulation of 
5 initiation and ternination regulatory regions 
functional in said host, whereby said peptide is 
expressed; and 

isolating the expression product. 

lO 22, A 3iethod according to Clain 21, wherein 

said peptide comprises at least one of the repeating 
unios GAGAG3 and GTGV?. 
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(j) xnoll digestion 

(4) rnsartlsn of idaoiar :aquercs: 



GATCCTAT3TTTAAA7ArrCTC3C:iArCCi'>T3uuT"CC^TGuAC--jGGTC jATA^ 

iArACAAATTTATAjiuLAGCjCTAGGCTACCCACGCrrAC^juuCCZAGCTATAGT , AG 

3aiT!HI Ahalil Ssol Hr-jl 3ani Styt Aval EcsRV 3inHI 

^C3t StjI ?vuI: 



3tn£ 




3an£ 
Hindlll 



3anl 



3a2irtl 




5. Sul digest 

6. ?nrtfy 130 5o 

7. »jIt1««H24t1on 
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